Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Barry Rountree
On Fri, Jan 18, 2008 at 08:33:10PM -0500, Jeff Squyres wrote: > Barry -- > > Could you check what apps are still running when it hangs? I.e., I > assume that all the uptime's are dead; are all the orted's dead on the > remote nodes? (orted = our helper process that is launched on the > rem

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Barry Rountree
On Thu, Jan 24, 2008 at 03:01:40AM -0500, Barry Rountree wrote: > On Fri, Jan 18, 2008 at 08:33:10PM -0500, Jeff Squyres wrote: > > Barry -- > > > > Could you check what apps are still running when it hangs? I.e., I > > assume that all the uptime's are dead; are all the orted's dead on the >

[OMPI users] (no subject)

2008-01-24 Thread Wong, Wayne
I'm having some difficulty geting the Open MPI checkpoint/restart fault tolerance working. I have compiled Open MPI with the "--with-ft=cr" flag, but when I attempt to run my test program (ring), the ompi-checkpoint command fails. I have verified that the test program works fine without the fault

[OMPI users] openmpi-1.2.4-1/OFED 1.2.5.4 ConnectX MPI_Reduce hang

2008-01-24 Thread Mostyn Lewis
Hello, I have a very simple MPI program hanging in MPI_Reduce using the openmpi-1.2.4-1 as supplied with OFED 1.2.5.4 (running this too). It works with same hardware using the supplied mvapich (mvapich-0.9.9). The hardware is a Mellanox Technologies MT25418 [ConnectX IB DDR] (rev a0) HCA (SUN/v

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Tim Mattox
Hello Barry, I am guessing you are trying to use a threaded build of Open MPI... Unfortunately, the threading support in Open MPI 1.2.x is not only not well tested, it has many known problems. We do not advise use of threading in the Open MPI 1.2.x series. We even added a warning in version 1.2.

[OMPI users] Topology functions from MPI 1.1

2008-01-24 Thread David Souther
Hello, My name is David Souther, and I am a student working on a parallel processing research project at Rocky Mountain College. We need to attach topology information to our processes, but the assertions we have been making about the MPI Topology mechanism seem to be false. We would like to do s

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Barry Rountree
On Thu, Jan 24, 2008 at 04:03:49PM -0500, Tim Mattox wrote: > Hello Barry, > I am guessing you are trying to use a threaded build of Open MPI... > > Unfortunately, the threading support in Open MPI 1.2.x is not only not well > tested, it has many known problems. We do not advise use of threading