This looks like a problem with the memory allocator. It could be a genuine problem with Open MPI, or it could be a memory fault in your application (that happens to dead-end in one of our libraries because we intercept memory allocation functions). From the stack trace, it looks like you're in the middle of a complex deallocation of some C++ objects, so I really can't tell (i.e., not in an MPI function at all).

Can you do a few things:

- configure your Open MPI --with-memory-manager=none and see if the problem goes away. This tells Open MPI to not intercept memory manager functions, so if you still have the problem, it's more likely to be a problem in your application than in OMPI.

- run your application through a memory-checking debugger (such as valgrind) and see if it identifies any memory faults within your code.

- send the additional information for run-time problems listed on http://www.open-mpi.org/community/help/

Thanks!


On Jan 25, 2006, at 10:39 PM, Glenn Morris wrote:


I tried nightly snapshot 1.1a1r8803 and it said the following. I'm
willing to try and debug this further, but would need some guidance. I
have access to totalview.


Signal:11 info.si_errno:0(Success) si_code:2(SEGV_ACCERR)
Failing at addr:0x97421004
[0] func:/afs/slac.stanford.edu/g/ki/users/gmorris/tmp/ ompi-1.1a1r8803/lib/libopal.so.0 [0x1cc9fa]
[1] func:/lib/tls/libpthread.so.0 [0xfd2f80]
[2] func:/afs/slac.stanford.edu/g/ki/users/gmorris/tmp/ ompi-1.1a1r8803/lib/libopal.so.0(free+0x5e) [0x1cf0a2]
[3] func:./cosmomc(for_deallocate+0x56) [0x80d8806]
[4] func:./cosmomc(for_dealloc_allocatable+0x59) [0x80d886d]
[5] func:./cosmomc(spline_+0x4f2) [0x805a2ea]
[6] func:./cosmomc(cambmain_mp_initsourceinterpolation_+0x243) [0x8089b65] [7] func:/afs/slac.stanford.edu/package/intel_tools/compiler8.1/ i386_linux24/ifort/lib/libguide.so(__kmp_invoke_microtask+0x4d) [0x19f8cd] [8] func:/afs/slac.stanford.edu/package/intel_tools/compiler8.1/ i386_linux24/ifort/lib/libguide.so(__kmpc_invoke_task_func+0xa2) [0x18fea6] [9] func:/afs/slac.stanford.edu/package/intel_tools/compiler8.1/ i386_linux24/ifort/lib/libguide.so(__kmp_internal_fork+0x19b) [0x1900a1] [10] func:/afs/slac.stanford.edu/package/intel_tools/compiler8.1/ i386_linux24/ifort/lib/libguide.so(__kmp_fork_call+0x334) [0x18af18] [11] func:/afs/slac.stanford.edu/package/intel_tools/compiler8.1/ i386_linux24/ifort/lib/libguide.so(__kmpc_fork_call+0x35) [0x19242d] [12] func:./cosmomc(cambmain_mp_initsourceinterpolation_+0xa8) [0x80899ca]
[13] func:./cosmomc(cambmain_mp_cmbmain_+0x8a8) [0x8085c34]
[14] func:./cosmomc(camb_mp_camb_getresults_+0x99) [0x80936db]
[15] func:./cosmomc(camb_mp_camb_gettransfers_+0x117) [0x8093387]
[16] func:./cosmomc(cmb_cls_mp_getcls_+0x102) [0x80aa82a]
[17] func:./cosmomc(calclike_mp_getloglikepost_+0x1f9) [0x80b43e9]
[18] func:./cosmomc(calclike_mp_getloglike_+0x23e) [0x80b41d0]
[19] func:./cosmomc(montecarlo_mp_mcmcsample_+0x130) [0x80b68dc]
[20] func:./cosmomc(MAIN__+0x15a3) [0x80b885b]
[21] func:./cosmomc(main+0x20) [0x8059758]
[22] func:/lib/tls/libc.so.6(__libc_start_main+0xda) [0x23c79a]
[23] func:./cosmomc(sinh+0x49) [0x8059611]
*** End of error message ***
1 process killed (possibly by Open MPI)


_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


--
{+} Jeff Squyres
{+} The Open MPI Project
{+} http://www.open-mpi.org/


Reply via email to