Re: [OMPI users] Occasional mpirun hang on completion

2008-01-25 Thread Barry Rountree
On Thu, Jan 24, 2008 at 10:09:51PM -0500, Barry Rountree wrote: > On Thu, Jan 24, 2008 at 04:03:49PM -0500, Tim Mattox wrote: > > Hello Barry, > > I am guessing you are trying to use a threaded build of Open MPI... > > > > Unfortunately, the threading support in Open MPI 1.2.x is not only not well

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Barry Rountree
On Thu, Jan 24, 2008 at 04:03:49PM -0500, Tim Mattox wrote: > Hello Barry, > I am guessing you are trying to use a threaded build of Open MPI... > > Unfortunately, the threading support in Open MPI 1.2.x is not only not well > tested, it has many known problems. We do not advise use of threading

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Tim Mattox
Hello Barry, I am guessing you are trying to use a threaded build of Open MPI... Unfortunately, the threading support in Open MPI 1.2.x is not only not well tested, it has many known problems. We do not advise use of threading in the Open MPI 1.2.x series. We even added a warning in version 1.2.

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Barry Rountree
On Thu, Jan 24, 2008 at 03:01:40AM -0500, Barry Rountree wrote: > On Fri, Jan 18, 2008 at 08:33:10PM -0500, Jeff Squyres wrote: > > Barry -- > > > > Could you check what apps are still running when it hangs? I.e., I > > assume that all the uptime's are dead; are all the orted's dead on the >

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-24 Thread Barry Rountree
On Fri, Jan 18, 2008 at 08:33:10PM -0500, Jeff Squyres wrote: > Barry -- > > Could you check what apps are still running when it hangs? I.e., I > assume that all the uptime's are dead; are all the orted's dead on the > remote nodes? (orted = our helper process that is launched on the > rem

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-22 Thread Marco Sbrighi
Dear Barry and Jeff, using OpenMPI we are experimenting something like the behaviour reported by Barry. Let me to introduce the context: we are using RHEL4 U4 on 2 way, AMD Opteron dual core, nodes. Each node is equipped with 16 GB of RAM, plus 4 GB of SWAP. OpenMPi is 1.2.2. Sometimes, for jo

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-18 Thread Barry Rountree
On Fri, Jan 18, 2008 at 08:33:10PM -0500, Jeff Squyres wrote: > Barry -- > > Could you check what apps are still running when it hangs? I.e., I > assume that all the uptime's are dead; are all the orted's dead on the > remote nodes? (orted = our helper process that is launched on the > rem

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-18 Thread Jeff Squyres
Barry -- Could you check what apps are still running when it hangs? I.e., I assume that all the uptime's are dead; are all the orted's dead on the remote nodes? (orted = our helper process that is launched on the remote nodes to exert process control, funnel I/O back and forth to mpirun

Re: [OMPI users] Occasional mpirun hang on completion

2008-01-13 Thread Barry Rountree
On Sun, Jan 13, 2008 at 09:54:47AM -0500, Barry Rountree wrote: > Hello, > > The following command > > mpirun -np 2 -hostfile ~/hostfile uptime > > will occasionally hang after completing. The expected output appears on > the screen, but mpirun needs a SIGKILL to return to the console. > > T

[OMPI users] Occasional mpirun hang on completion

2008-01-13 Thread Barry Rountree
Hello, The following command mpirun -np 2 -hostfile ~/hostfile uptime will occasionally hang after completing. The expected output appears on the screen, but mpirun needs a SIGKILL to return to the console. This has been verified with OpenMPI v1.2.4 compiled with both icc 9.1 20061101 (aka 9