Husen R <hus...@gmail.com> writes: > Dear Open MPI Users, > > > Does the current stable release of Open MPI (v1.10 series) support fault > tolerant feature ? > I got the information from Open MPI FAQ that The checkpoint/restart support > was last released as part of the v1.6 series. > I just want to make sure about this.
Orthogonal to Jeff's comments: dmtcp <http://dmtcp.sourceforge.net/> is advertised as able to checkpoint OMPI, at least over TCP and IB (for some value of "IB"). Does anyone here have experience with that?