Re: [OMPI users] Disconnections

2009-07-01 Thread Ralph Castain
On Jul 1, 2009, at 3:10 PM, Daniel Miles wrote: Hi, everybody. I’m having trouble where one of my client nodes crashes while I have an MPI job on it. When this happens, the mpirun process on the head node never returns. This shouldn't happen - we should cleanly abort. What version are yo

[OMPI users] Disconnections

2009-07-01 Thread Daniel Miles
Hi, everybody. I¹m having trouble where one of my client nodes crashes while I have an MPI job on it. When this happens, the mpirun process on the head node never returns. I can kill it with a SIGINT (ctrl-c) and it still cleans up its child processes on the remaining healthy client nodes but I do