Hi all,How can I create ckpt files regularly? I mean, do checkpoint every
100 seconds. Is there any options to do this? Or I have to write a script
myself?THANKS,---CHEN SongR&D DepartmentNational Supercomputer
Center in TianjinBinhai New Area, Tianjin, China
feature or have a strong use
case for a set of features, then that is important information for the
Open MPI developer community. This will help use as a project
prioritize the maintenance of various features in the Open MPI
project.
Best of luck,
Josh
On Wed, Jun 20, 2012 at 2:59 AM, 陈松 <chens
not needed so we
(read this as the official version of Open MPI) do not support it.However, a
group of researchers have been working toward a version of Open MPI that
supports the last fault tolerance proposal submitted for consideration to the
MPI Forum. You can access it
at https://bitbucket.org/jj
Hi all,Can anyone explain me the fault tolerant features in OpenMPI? I've read
the FAQs and some papers about this topic listed in open-mpi.org, but still
can't figure out when one node of my supercomputer system fails down during
computing, what would happen with the fault tolerant mechanism in