Fixed this, not all paths were in variables. Sorry. > Hello. I've got following problem. I've run MPI programm and successful > checkpointed it with BLCR. > But now, when I'm trying to restart it using ompi-restart -v > ompi_global_snapshot_7190.ckpt I'm getting following message:
> [umu2:07572] Checking for the existence of > (/root/ompi_global_snapshot_7190.ckpt) > [umu2:07572] Restarting from file (ompi_global_snapshot_7190.ckpt) > [umu2:07572] Exec in self > -------------------------------------------------------------------------- > Error: Unable to obtain the proper restart command to restart from the > checkpoint file (ompi_global_snapshot_7190.ckpt). Returned -1. > -------------------------------------------------------------------------- > Both Open-MPI and BLCR are installed into shared NFS directory, blcr > directories are included into PATH and LD_LIBRARY_PATH variables on > restart node. > Open MPI initially configured with keys > −−with−ft=cr −−enable−ft−thread −−enable−mpi−thread > −−with−blcr=/path/to/blcr > Program was run with -am ft-enable-cr. > What can cause such problem? > -------------------------------------------- > With best regards > Gleb "Crazy Sage" Igumnov > mailto:crazy.s...@gmail.com -- With best regards, Gleb "Crazy Sage" Igumnov mailto:crazy.s...@gmail.com