The problem is not with your setup, but with a limitation of the
current checkpoint/restart implementation in Open MPI. Currently,
Open MPI requires that the MPI process be inside the MPI library in
order to make progress on a checkpoint request. This is because all
checkpoint coordinatio
Hi,
I had compiled and installed Open MPI with C/R support in the way Josh said.
When finished, Open MPI had support and tools for C/R: ompi-checkpoint,
ompi-restart.
And I try an example ( hello_c.c in examples folder, but I edit it with a
for loop to print out "Hello..." 1,000,000 times)
But I ge
I just updated the Users Guide on the wiki. I added the following
sections:
- Compiling and Building
- Example Checkpoint/Restart Use Cases
Let me know if you have any trouble with this work.
Cheers,
Josh
On Aug 21, 2007, at 1:46 PM, Josh Hursey wrote:
Hello,
There are a few things you n
Hello,
There are a few things you need to do to build Open MPI with
Checkpoint/Restart support. By default Open MPI is configured without
checkpoint/restart support.
1) Make sure you have BLCR successfully installed and loaded on your
system(s)
2) configure Open MPI with the "--with-ft=cr"
Hello,
I'm Hiep, I'm trying to use checkpoint/restart feature in Open MPI. I had
read information about this feature in
https://svn.open-mpi.org/trac/ompi/wiki/ProcessFT_CR
and Open-MPI-FT-CR-Draft-v1.pdf. I had built Open MPI from "trunk" which
gotten by Subversion.
But I don't know how to enable