Re: [OMPI users] Checkpoint an MPI process

2012-01-20 Thread Josh Hursey
Rodrigo, Open MPI has the ability to migrate a subset of processes (in the trunk - though currently broken due to recent code movement, I'm slowing developing the fix in my spare time). The current implementation only checkpoints the migrating processes, but suspends all other processes during the

Re: [OMPI users] Checkpoint an MPI process

2012-01-20 Thread Rodrigo Oliveira
I appreciate your help. Indeed, it's better to create my own mechanism as mentioned Lloyd. Actually my application is a framework to stream processing (something like IBM System-S), in which I use Open MPI as communication layer and part of process management. One of this framework's features is t

Re: [OMPI users] Checkpoint an MPI process

2012-01-19 Thread Lloyd Brown
Since you're looking for a function call, I'm going to assume that you are writing this application, and it's not a pre-compiled, commercial application. Given that, it's going to be significantly better to have an internal application checkpointing mechanism, where it serializes and stores the da

Re: [OMPI users] Checkpoint an MPI process

2012-01-19 Thread Josh Hursey
Currently Open MPI only supports the checkpointing of the whole application. There has been some work on uncoordinated checkpointing with message logging, though I do not know the state of that work with regards to availability. That work has been undertaken by the University of Tennessee Knoxville

[OMPI users] Checkpoint an MPI process

2012-01-18 Thread Rodrigo Oliveira
Hi, I'd like to know if there is a way to checkpoint a specific process running under an mpirun call. In other words, is there a function CHECKPOINT(rank) in which I can pass the rank of the process I want to checkpoint? I do not want to checkpoint the entire application, but just one of its proce