On Jul 1, 2009, at 3:10 PM, Daniel Miles wrote:
Hi, everybody.
I’m having trouble where one of my client nodes crashes while I have
an MPI job on it. When this happens, the mpirun process on the head
node never returns.
This shouldn't happen - we should cleanly abort. What version are yo
Hi, everybody.
I¹m having trouble where one of my client nodes crashes while I have an MPI
job on it. When this happens, the mpirun process on the head node never
returns. I can kill it with a SIGINT (ctrl-c) and it still cleans up its
child processes on the remaining healthy client nodes but I do