Is there any chance you can update to Open MPI 1.4.2?
On Sep 11, 2010, at 9:35 AM, Srikanth Raju wrote: > Hello OMPI Users, > I'm using OpenMPI 1.4.1 with gcc 4.4.3 on my x86_64 linux system running the > latest Ubuntu 10.04 distro. I don't seem to be able to run any OpenMPI > application. I try running the simplest application, which goes like this > > #include<mpi.h> > int main(int argc, char * argv[]) > { > MPI_Init(NULL, NULL); > MPI_Finalize(); > } > > Compiling it with "mpicc -g test.c" > Running with "mpirun -n 2 -hostfile hosts a.out" > hosts file contains "localhost slots=2" > On run, I get this > > > [starbuck:18829] *** Process received signal *** > [starbuck:18830] *** Process received signal *** > [starbuck:18830] Signal: Segmentation fault (11) > [starbuck:18830] Signal code: Address not mapped (1) > [starbuck:18830] Failing at address: 0x3c > [starbuck:18829] Signal: Segmentation fault (11) > [starbuck:18829] Signal code: Address not mapped (1) > [starbuck:18829] Failing at address: 0x3c > [starbuck:18830] [ 0] /lib/libpthread.so.0(+0xf8f0) [0x7f3b0aae08f0] > [starbuck:18830] [ 1] /usr/local/lib/libmca_common_sm.so.1(+0x1561) > [0x7f3b082e8561] > [starbuck:18830] [ 2] > /usr/local/lib/libmca_common_sm.so.1(mca_common_sm_mmap_init+0x6c1) > [0x7f3b082e9137] > [starbuck:18830] [ 3] /usr/lib/openmpi/lib/openmpi/mca_mpool_sm.so(+0x137b) > [0x7f3b084ed37b] > [starbuck:18830] [ 4] /usr/lib/libmpi.so.0(mca_mpool_base_module_create+0x7d) > [0x7f3b0bacc38d] > [starbuck:18830] [ 5] /usr/lib/openmpi/lib/openmpi/mca_btl_sm.so(+0x2a38) > [0x7f3b06c52a38] > [starbuck:18830] [ 6] /usr/lib/openmpi/lib/openmpi/mca_bml_r2.so(+0x18e7) > [0x7f3b076a48e7] > [starbuck:18830] [ 7] /usr/lib/openmpi/lib/openmpi/mca_pml_ob1.so(+0x258c) > [0x7f3b07aae58c] > [starbuck:18830] [ 8] /usr/lib/libmpi.so.0(+0x392bf) [0x7f3b0ba8b2bf] > [starbuck:18830] [ 9] /usr/lib/libmpi.so.0(MPI_Init+0x170) [0x7f3b0baac330] > [starbuck:18830] [10] a.out(main+0x22) [0x400866] > [starbuck:18830] [11] /lib/libc.so.6(__libc_start_main+0xfd) [0x7f3b0a76cc4d] > [starbuck:18830] [12] a.out() [0x400789] > [starbuck:18830] *** End of error message *** > [starbuck:18829] [ 0] /lib/libpthread.so.0(+0xf8f0) [0x7fb6efefe8f0] > [starbuck:18829] [ 1] /usr/local/lib/libmca_common_sm.so.1(+0x1561) > [0x7fb6ed706561] > [starbuck:18829] [ 2] > /usr/local/lib/libmca_common_sm.so.1(mca_common_sm_mmap_init+0x6c1) > [0x7fb6ed707137] > [starbuck:18829] [ 3] /usr/lib/openmpi/lib/openmpi/mca_mpool_sm.so(+0x137b) > [0x7fb6ed90b37b] > [starbuck:18829] [ 4] /usr/lib/libmpi.so.0(mca_mpool_base_module_create+0x7d) > [0x7fb6f0eea38d] > [starbuck:18829] [ 5] /usr/lib/openmpi/lib/openmpi/mca_btl_sm.so(+0x2a38) > [0x7fb6ec070a38] > [starbuck:18829] [ 6] /usr/lib/openmpi/lib/openmpi/mca_bml_r2.so(+0x18e7) > [0x7fb6ecac28e7] > [starbuck:18829] [ 7] /usr/lib/openmpi/lib/openmpi/mca_pml_ob1.so(+0x258c) > [0x7fb6ececc58c] > [starbuck:18829] [ 8] /usr/lib/libmpi.so.0(+0x392bf) [0x7fb6f0ea92bf] > [starbuck:18829] [ 9] /usr/lib/libmpi.so.0(MPI_Init+0x170) [0x7fb6f0eca330] > [starbuck:18829] [10] a.out(main+0x22) [0x400866] > [starbuck:18829] [11] /lib/libc.so.6(__libc_start_main+0xfd) [0x7fb6efb8ac4d] > [starbuck:18829] [12] a.out() [0x400789] > [starbuck:18829] *** End of error message *** > -------------------------------------------------------------------------- > mpirun noticed that process rank 1 with PID 18830 on node starbuck exited on > signal 11 (Segmentation fault). > -------------------------------------------------------------------------- > > My stack trace from gdb is: > > Program received signal SIGSEGV, Segmentation fault. > 0x00007ffff43c2561 in opal_list_get_first (list=0x7ffff45c5240) > at ../../../../../opal/class/opal_list.h:201 > 201 assert(1 == item->opal_list_item_refcount); > (gdb) bt > #0 0x00007ffff43c2561 in opal_list_get_first (list=0x7ffff45c5240) > at ../../../../../opal/class/opal_list.h:201 > #1 0x00007ffff43c3137 in mca_common_sm_mmap_init (procs=0x673cb0, > num_procs=2, size=67113040, > file_name=0x673c40 > "/tmp/openmpi-sessions-srikanth@starbuck_0/1510/1/shared_mem_pool.starbuck", > size_ctl_structure=4176, data_seg_alignment=8) > at ../../../../../ompi/mca/common/sm/common_sm_mmap.c:291 > #2 0x00007ffff45c737b in mca_mpool_sm_init (resources=<value optimized out>) > at ../../../../../../ompi/mca/mpool/sm/mpool_sm_component.c:214 > #3 0x00007ffff7ba638d in mca_mpool_base_module_create () > from /usr/lib/libmpi.so.0 > #4 0x00007ffff2d2ca38 in sm_btl_first_time_init (btl=<value optimized out>, > nprocs=<value optimized out>, procs=<value optimized out>, > peers=<value optimized out>, reachability=<value optimized out>) > at ../../../../../../ompi/mca/btl/sm/btl_sm.c:228 > #5 mca_btl_sm_add_procs (btl=<value optimized out>, > nprocs=<value optimized out>, procs=<value optimized out>, > peers=<value optimized out>, reachability=<value optimized out>) > at ../../../../../../ompi/mca/btl/sm/btl_sm.c:500 > #6 0x00007ffff377e8e7 in mca_bml_r2_add_procs (nprocs=<value optimized out>, > procs=0x2, reachable=0x7fffffffdd00) > at ../../../../../../ompi/mca/bml/r2/bml_r2.c:206 > #7 0x00007ffff3b8858c in mca_pml_ob1_add_procs (procs=0x678ce0, nprocs=2) > ---Type <return> to continue, or q <return> to quit--- > at ../../../../../../ompi/mca/pml/ob1/pml_ob1.c:315 > #8 0x00007ffff7b652bf in ?? () from /usr/lib/libmpi.so.0 > #9 0x00007ffff7b86330 in PMPI_Init () from /usr/lib/libmpi.so.0 > #10 0x0000000000400866 in main (argc=1, argv=0x7fffffffe008) > at test.c:4 > > I can't figure out what's going on here! It says MPI_Init is segfaulting, but > I think it is probably some kind of misconfiguration. > I have tried reinstalling the openmpi package. I have an AMD Turion X2 > M500(64 bit) processor. > > The interesting thing is, the Segfault occurs only when I try to run multiple > processes. With n = 1, it has no problems. > Thanks for any help! > > -- > Regards, > Srikanth Raju > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/