Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Jeff Squyres
If this is the problem, that's good -- it just means that we need a better error detection in the case where we run out of memory, etc. Stay tuned to that thread to see what happens. On Nov 14, 2008, at 1:14 PM, Peter Cebull wrote: Jeff Squyres wrote: Could this issue actually be related

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Peter Cebull
Jeff Squyres wrote: Could this issue actually be related to: http://www.open-mpi.org/community/lists/devel/2008/11/4882.php (read through the thread to get to the error handling stuff) You might be right that this issue is the problem. Our system has diskless nodes, so /tmp uses a ramdisk.

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Jeff Squyres
Could this issue actually be related to: http://www.open-mpi.org/community/lists/devel/2008/11/4882.php (read through the thread to get to the error handling stuff) On Nov 14, 2008, at 7:41 AM, Geraldo Veiga wrote: Thanks Peter. Blocking the shared memory layer did the trick for our p

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Geraldo Veiga
Thanks Peter. Blocking the shared memory layer did the trick for our program too. For the record, we also have SGI Propack 6 installed (sgi-propack-release-6-sgi600r3). Is the on-node shared memory support completely blocked? What if the MPI process calls a procedure that uses OpenMP threads, f

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-13 Thread Peter Cebull
Geraldo, The previous message you saw was for our Altix ICE system. Since we started seeing these errors after upgrading to SGI Propack 6, I wonder if there's a bug somewhere in the Propack software or an incompatibility between Open MPI and OFED 1.3 (we had no problems under OFED 1.2). A wor