Re: [OMPI users] ompi-checkpoint hangs when using in multiple clusters

2010-03-23 Thread Fernando Lemos
On Tue, Mar 23, 2010 at 1:25 PM, fengguang tian wrote: > now, I set $HOME as shared directory, but when doing ompi-checkpoint, it > shows:(nimbus1 is the remote machine in > my cluster) > > [nimbus1:12630] opal_os_dirpath_create: Error: Unable to create the > sub-directory (/home/mpiu/ompi_global_

Re: [OMPI users] ompi-checkpoint hangs when using in multiple clusters

2010-03-23 Thread fengguang tian
now, I set $HOME as shared directory, but when doing ompi-checkpoint, it shows:(nimbus1 is the remote machine in my cluster) [nimbus1:12630] opal_os_dirpath_create: Error: Unable to create the sub-directory (/home/mpiu/ompi_global_snapshot_1662.ckpt/0) of (/home/mpiu/ompi_global_snapshot_1662.ckpt

Re: [OMPI users] ompi-checkpoint hangs when using in multiple clusters

2010-03-23 Thread Fernando Lemos
On Tue, Mar 23, 2010 at 12:24 PM, fengguang tian wrote: > Hi > > I am using open-mpi and blcr in a cluster of 3 machines, and the checkpoint > and restart work fine in single machine,but when doing checkpoint in > clusters environment, the ompi-checkpoint hangs Besdies what has been said in anoth