What happens exactly when a job or node crashes? Does orte send a SIGTERM to each process?
Best regards, Jürgen Durga Choudhury wrote: > This would be a very welcoming new feature for me as well. My two > thumbs up when it happens. > > Best regards > Durga > > > On Tue, Apr 13, 2010 at 10:28 AM, Ralph Castain <r...@open-mpi.org> wrote: > >> Not right now, but coming later this year... >> >> On Apr 13, 2010, at 7:21 AM, Jürgen Kaiser wrote: >> >> >>> Hi, >>> >>> Can I force MPI to not abort the whole job when a node crashes? I would >>> like to let the remaining MPI-processes perform some action in that case >>> and then proceed. >>> >>> Thanks, >>> Jürgen >>> _______________________________________________ >>> users mailing list >>> us...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >> _______________________________________________ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users >> >> > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users >