On Feb 25, 2013, at 10:38 AM, Bokassa <boka...@gmail.com> wrote:

> Hi, 
>    I noticed that MPI_Abort() does not abort the tasks if the mpi program is 
> started using srun.
> I call MPI_Abort() from rank 0, this process exit, but the other ranks keep 
> running or waiting for IO 
> on the other nodes. The only way to kill the job is to use scancel. 
> However if I use mpirun under a slurm allocation then MPI_Abort() works as 
> expected aborting 
> all tasks.
> 
> Is this a known issue?

What version of OMPI are you using? Slurm should detect the process failure and 
kill the job, unless it was configured not to do so.

> 
> Thanks, David
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


Reply via email to