HI,
       I have installed the openmpi-1.3a1r18651 and tried to checkpoint an mpi 
application. 

raj@portal018:~/examples> mpirun  -np 1 -am ft-enable-cr ./myapp.sh &

raj@portal018:~/examples> ompi-checkpoint --term 30416


However, when i try to restart the checkped file, I get the following message. 


raj@portal018:~> ompi-restart -v -machinefile portal018 
ompi_global_snapshot_30416.ckpt
[portal018:20178] Checking for the existence of 
(/home/raj/ompi_global_snapshot_30416.ckpt)
[portal018:20178] Restarting from file (ompi_global_snapshot_30416.ckpt)
[portal018:20178]        Exec in self
--------------------------------------------------------------------------
mpirun could not find anything to do.
It is possible that you forgot to specify how many processes to run
via the "-np" argument.
--------------------------------------------------------------------------


Any help will be very appreciated.

Regards,

Raj



Reply via email to