Rodrigo,
Open MPI has the ability to migrate a subset of processes (in the trunk -
though currently broken due to recent code movement, I'm slowing developing
the fix in my spare time). The current implementation only checkpoints the
migrating processes, but suspends all other processes during the
I appreciate your help.
Indeed, it's better to create my own mechanism as mentioned Lloyd. Actually
my application is a framework to stream processing (something like IBM
System-S), in which I use Open MPI as communication layer and part of
process management. One of this framework's features is t
Since you're looking for a function call, I'm going to assume that you
are writing this application, and it's not a pre-compiled, commercial
application. Given that, it's going to be significantly better to have
an internal application checkpointing mechanism, where it serializes and
stores the da
Currently Open MPI only supports the checkpointing of the whole
application. There has been some work on uncoordinated checkpointing with
message logging, though I do not know the state of that work with regards
to availability. That work has been undertaken by the University of
Tennessee Knoxville
Hi,
I'd like to know if there is a way to checkpoint a specific process running
under an mpirun call. In other words, is there a function CHECKPOINT(rank)
in which I can pass the rank of the process I want to checkpoint? I do not
want to checkpoint the entire application, but just one of its proce