On Tue, Mar 23, 2010 at 1:25 PM, fengguang tian wrote:
> now, I set $HOME as shared directory, but when doing ompi-checkpoint, it
> shows:(nimbus1 is the remote machine in
> my cluster)
>
> [nimbus1:12630] opal_os_dirpath_create: Error: Unable to create the
> sub-directory (/home/mpiu/ompi_global_
now, I set $HOME as shared directory, but when doing ompi-checkpoint, it
shows:(nimbus1 is the remote machine in
my cluster)
[nimbus1:12630] opal_os_dirpath_create: Error: Unable to create the
sub-directory (/home/mpiu/ompi_global_snapshot_1662.ckpt/0) of
(/home/mpiu/ompi_global_snapshot_1662.ckpt
On Tue, Mar 23, 2010 at 12:24 PM, fengguang tian wrote:
> Hi
>
> I am using open-mpi and blcr in a cluster of 3 machines, and the checkpoint
> and restart work fine in single machine,but when doing checkpoint in
> clusters environment, the ompi-checkpoint hangs
Besdies what has been said in anoth