Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Jeff Squyres
If this is the problem, that's good -- it just means that we need a better error detection in the case where we run out of memory, etc. Stay tuned to that thread to see what happens. On Nov 14, 2008, at 1:14 PM, Peter Cebull wrote: Jeff Squyres wrote: Could this issue actually be related

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Peter Cebull
Jeff Squyres wrote: Could this issue actually be related to: http://www.open-mpi.org/community/lists/devel/2008/11/4882.php (read through the thread to get to the error handling stuff) You might be right that this issue is the problem. Our system has diskless nodes, so /tmp uses a ramdisk.

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Jeff Squyres
Could this issue actually be related to: http://www.open-mpi.org/community/lists/devel/2008/11/4882.php (read through the thread to get to the error handling stuff) On Nov 14, 2008, at 7:41 AM, Geraldo Veiga wrote: Thanks Peter. Blocking the shared memory layer did the trick for our p

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-14 Thread Geraldo Veiga
Thanks Peter. Blocking the shared memory layer did the trick for our program too. For the record, we also have SGI Propack 6 installed (sgi-propack-release-6-sgi600r3). Is the on-node shared memory support completely blocked? What if the MPI process calls a procedure that uses OpenMP threads, f

Re: [OMPI users] Bus Error in ompi_free_list_grow

2008-11-13 Thread Peter Cebull
Geraldo, The previous message you saw was for our Altix ICE system. Since we started seeing these errors after upgrading to SGI Propack 6, I wonder if there's a bug somewhere in the Propack software or an incompatibility between Open MPI and OFED 1.3 (we had no problems under OFED 1.2). A wor

[OMPI users] Bus Error in ompi_free_list_grow

2008-11-13 Thread Geraldo Veiga
Hi to all, I am using the same subject of a recent message I found in the list archives of this mailing list: http://www.open-mpi.org/community/lists/users/2008/10/7025.php There was no follow-up on that one, but will add this similar report in case a list member can give us an idea of how to c

[OMPI users] Bus Error in ompi_free_list_grow

2008-10-17 Thread Allen Barnett
Hi: A customer is running our parallel application on an SGI Altix machine. They compiled OMPI 1.2.8 themselves. The Altix uses IB interfaces and they recently upgraded to OFED 1.3 (in SGI Propack 6). They are receiving a bus error in ompi_free_list_grow: [r1i0n0:01321] *** Process received signal