Re: [OMPI users] Open MPI Checkpoint Restart

2013-06-04 Thread Neel Sunil Desai
Hi, So, I was able to remove the "cannot open shared file or object" errors. But I am not able to checkpoint yet. When I enter ompi-checkpoint PID of mpirun, it does not return anything (not even a new prompt). In my mca-params.conf file, I added sstore=stage sstore_stage_local_snapshot_dir=/tm

Re: [OMPI users] Open MPI Checkpoint Restart

2013-06-03 Thread Neel Sunil Desai
Hi Ralph. I checked the errors. I do not understand what the fololowing means : The session directory location could not be parsed. ompi-checkpoint attempted to use the session directory: /tmp/openmpi-sessions-ndesai@vcainternmpi01_0 I opened the /tmp/openmpi-sessions-ndesai direct

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Ralph Castain
Did you check the items on the list given in the error? I'm no expert on ompi-checkpoint, but the error means that one of those conditions isn't being met. On May 31, 2013, at 4:54 PM, Neel Sunil Desai wrote: > Hi Ralph, > > Thanks for the help. The path and ld_path were not set to the corr

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Neel Sunil Desai
Hi Ralph, Thanks for the help. The path and ld_path were not set to the correct location. I was able to execute the ompi-checkpoint command. But, I got the following error. [ndesai@vcainternmpi01 ~]$ ompi-checkpoint 1803 -- E

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread George Bosilca
Take a look at config.log to see why the FT support has been turned off. Maybe the configure script failed to find BLCR? George. On Jun 1, 2013, at 01:31 , Neel Sunil Desai wrote: > Hi Ralph, > > I did install open mpi with the --with-ft=cr option. > > Thanks, > Neel. > > On Fri, May 31,

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Ralph Castain
Check that your path and ld_library_path are set to point to the directory where you installed the version you built (the --prefix=<> you provided). On May 31, 2013, at 4:31 PM, Neel Sunil Desai wrote: > Hi Ralph, > > I did install open mpi with the --with-ft=cr option. > > Thanks, > Neel.

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Neel Sunil Desai
Hi Ralph, I did install open mpi with the --with-ft=cr option. Thanks, Neel. On Fri, May 31, 2013 at 4:25 PM, Ralph Castain wrote: > Okay, it should work it that version. It sounds like you didn't configure > OMPI with the --with-ft=cr option - yes? Take a look at "./configure -h" > for the ft

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Ralph Castain
Okay, it should work it that version. It sounds like you didn't configure OMPI with the --with-ft=cr option - yes? Take a look at "./configure -h" for the ft-related options and ensure you build what you need. C/R support is not built by default. On May 31, 2013, at 3:59 PM, Neel Sunil Desai

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Neel Sunil Desai
Open MPI 1.5.4 On Fri, May 31, 2013 at 3:31 PM, Ralph Castain wrote: > What OMPI version? > > On May 31, 2013, at 3:17 PM, Neel Sunil Desai > wrote: > > > Hi, > > > > I forgot to add. I watched the video of Joshua Hursey and when I type > ompi_info | grep FT, I get FT Checkpoint Support: no ( c

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Ralph Castain
What OMPI version? On May 31, 2013, at 3:17 PM, Neel Sunil Desai wrote: > Hi, > > I forgot to add. I watched the video of Joshua Hursey and when I type > ompi_info | grep FT, I get FT Checkpoint Support: no ( checkpoint thread : > no). I do not get anything when I type ompi_info | grep crs. >

Re: [OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Neel Sunil Desai
Hi, I forgot to add. I watched the video of Joshua Hursey and when I type ompi_info | grep FT, I get FT Checkpoint Support: no ( checkpoint thread : no). I do not get anything when I type ompi_info | grep crs. Thanks, Neel.

[OMPI users] Open MPI Checkpoint Restart

2013-05-31 Thread Neel Sunil Desai
Hi, I installed BLCR 0.8.5 in my red hat linux system. I can run the mpi program with the ft-enable-cr option. But, when I try to get a checkpoint, I get an error : bash: ompi-checkpoint: command not found. What should I do? Please help. Thanks , Neel.