Re: [OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-22 Thread Josh Hursey
The problem is not with your setup, but with a limitation of the current checkpoint/restart implementation in Open MPI. Currently, Open MPI requires that the MPI process be inside the MPI library in order to make progress on a checkpoint request. This is because all checkpoint coordinatio

Re: [OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-22 Thread Hiep Bui Hoang
Hi, I had compiled and installed Open MPI with C/R support in the way Josh said. When finished, Open MPI had support and tools for C/R: ompi-checkpoint, ompi-restart. And I try an example ( hello_c.c in examples folder, but I edit it with a for loop to print out "Hello..." 1,000,000 times) But I ge

Re: [OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-21 Thread Josh Hursey
I just updated the Users Guide on the wiki. I added the following sections: - Compiling and Building - Example Checkpoint/Restart Use Cases Let me know if you have any trouble with this work. Cheers, Josh On Aug 21, 2007, at 1:46 PM, Josh Hursey wrote: Hello, There are a few things you n

Re: [OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-21 Thread Josh Hursey
Hello, There are a few things you need to do to build Open MPI with Checkpoint/Restart support. By default Open MPI is configured without checkpoint/restart support. 1) Make sure you have BLCR successfully installed and loaded on your system(s) 2) configure Open MPI with the "--with-ft=cr"

[OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-21 Thread Hiep Bui Hoang
Hello, I'm Hiep, I'm trying to use checkpoint/restart feature in Open MPI. I had read information about this feature in https://svn.open-mpi.org/trac/ompi/wiki/ProcessFT_CR and Open-MPI-FT-CR-Draft-v1.pdf. I had built Open MPI from "trunk" which gotten by Subversion. But I don't know how to enable