On Sep 10, 2012, at 8:12 AM, Aleksey Senin <aleks...@dev.mellanox.co.il> wrote:

> On 10/09/2012 15:41, Siegmar Gross wrote:
>> Hi,
>> 
>> I have built openmpi-1.6.2rc1 and get the following error.
>> 
>> tyr small_prog 123 mpicc -showme
>> cc -I/usr/local/openmpi-1.6.2_32_cc/include -mt
>>   -L/usr/local/openmpi-1.6.2_32_cc/lib -lmpi -lm -lkstat -llgrp
>>   -lsocket -lnsl -lrt -lm
>> tyr small_prog 124 mpiexec -np 2 -host tyr init_finalize
>> 
>> Hello!
>> Hello!
>> 
>> tyr small_prog 125 mpiexec -np 2 -host sunpc4 init_finalize
>> key_from_blob: remaining bytes in key blob 81
>> 
>> Hello!
>> Hello!
>> 
>> tyr small_prog 126 mpiexec -np 2 -host tyr,sunpc4 init_finalize
>> [tyr:23956] *** Process received signal ***
>> [tyr:23956] Signal: Segmentation Fault (11)
>> [tyr:23956] Signal code: Address not mapped (1)
>> [tyr:23956] Failing at address: 18
>> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:0x15434c
>> /lib/libc.so.1:0xcad04
>> /lib/libc.so.1:0xbf3b4
>> /lib/libc.so.1:0xbf59c
>> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:orte_rmaps_base_get_target_nodes+0x1cc
>>  [ Signal 11 (SEGV)]
>> /.../openmpi-1.6.2_32_cc/lib/openmpi/mca_rmaps_round_robin.so:0x1ec8
>> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:orte_rmaps_base_map_job+0xe4
>> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:orte_plm_base_setup_job+0xc4
>> /.../openmpi-1.6.2_32_cc/lib/openmpi/mca_plm_rsh.so:orte_plm_rsh_launch+0x1b0
>> /.../openmpi-1.6.2_32_cc/bin/orterun:orterun+0x16a8
>> /.../openmpi-1.6.2_32_cc/bin/orterun:main+0x24
>> /.../openmpi-1.6.2_32_cc/bin/orterun:_start+0xd8
>> [tyr:23956] *** End of error message ***
>> Segmentation fault
>> 
>> Do you have any ideas or suggestions? As I wrote in my email from
>> yesterday, I had to add "#include <math.h>" into file
>> openmpi-1.6.2rc1/ompi/contrib/vt/vt/extlib/otf/tools/otfaux/otfaux.cpp
>> to have a prototype for function "rint" in line 834. Thank you very
>> much for any help in advance.
>> 
>> 
>> Kind regards
>> 
>> Siegmar
>> 
>> 
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>> 
> 
> Did you compile OMPI usins '--with-pmi' option? I saw this error when you 
> have allocated job on one console, but running the test on another. Try to 
> run the task on the console where you allocated it.

Really? That shouldn't happen - I'll take a look at that one.

> By the way, is there any way to disable SLURM usage even OMPI was compiled 
> with it? If yes, what is the option?

Do you mean "ignore the allocation"? Or do you mean "launch the job without 
using slurm"? The latter is simple - just "-mca plm rsh" will do the trick.


> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


Reply via email to