Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-05-24 Thread Nguyen Toan
-- Is there any idea about this? Thank you! Regards, Nguyen Toan On Mon, May 24, 2010 at 4:08 PM, Hideyuki Jitsumoto < jitum...@gsic.titech.ac.jp> wrote: > -- Forwarded message -- > From: Fernando Lemos > Date: Thu, Apr 15, 2

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-05-18 Thread Hideyuki Jitsumoto
Hi Josh, Thank you for your replying. I tried to patch a Ticket #2139 to openmpi-1.4.1 and to install all of the elements from the very beginning. Then I got a correct work. Probably there are some faults on my environment preparation. # I cannot reproduce the environment when I got failure. # I'

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-05-18 Thread Josh Hursey
(Sorry for the delay in replying, more below) On Apr 12, 2010, at 6:36 AM, Hideyuki Jitsumoto wrote: Hi Members, I tried to use checkpoint/restart by openmpi. But I can not get collect checkpoint data. I prepared execution environment as follows, the strings in () mean name of output file whic

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-14 Thread Fernando Lemos
On Wed, Apr 14, 2010 at 5:25 AM, Hideyuki Jitsumoto wrote: > Fernando, > > Thank you for your reply. > I tried to patch the file you mentioned, but the output did not change. I didn't test the patch, tbh. I'm using 1.5 nightly snapshots, and it works great. >>Are you using a shared file system?

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-14 Thread Hideyuki Jitsumoto
Fernando, Thank you for your reply. I tried to patch the file you mentioned, but the output did not change. >Are you using a shared file system? You need to use a shared file system for checkpointing with 1.4.1: What is the shared file system ? do you mean NFS, Lustre and so on ? (I'm sorry about

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-12 Thread Fernando Lemos
On Mon, Apr 12, 2010 at 7:36 AM, Hideyuki Jitsumoto wrote: > Hi Members, > > I tried to use checkpoint/restart by openmpi. > But I can not get collect checkpoint data. > I prepared execution environment as follows, the strings in () mean > name of output file which attached on next e-mail ( for ma

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-12 Thread Hideyuki Jitsumoto
I resend this mail for sending error ( I misused the email address on FROM.) Sorry if you receive multiple copies of this email. I attache a file (2/2) on this email as mentioned previous one. Thank you, Hideyuki openmpi_others_log.tar.gz Description: GNU Zip compressed data

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-12 Thread Hideyuki Jitsumoto
I resend this mail for sending error ( I misused the email address on FROM.) Sorry if you receive multiple copies of this email. I attache a file (1/2) on this email as mentioned previous one. openmpi_config_log.tar.gz Description: GNU Zip compressed data

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-12 Thread Hideyuki Jitsumoto
I attache a file (2/2) on this email as mentioned previous one. Thank you, Hideyuki * ** ** ** WARNING: This email contains an attachment of a very

Re: [OMPI users] OpenMPI Checkpoint/Restart is failed

2010-04-12 Thread Hideyuki Jitsumoto
I attache a file (1/2) on this email as mentioned previous one. I'm very sorry to send the large log file. Thank you, Hideyuki * ** ** ** WARNING: