Re: [OMPI users] Openmpi Checkpoint/Restart failed

2010-12-23 Thread 孟宪军
Dear all, I have figured it out. It was a simple issue, I didn't add the "blcr lib" to the $PATH environment varable. However, it can make checkpoint operation, but can't make restart operation successfully. It was so wield. Best regards Xianjun Meng 在 2010年12月23日 下午5:35,孟宪军 写道: > My main ques

Re: [OMPI users] Openmpi Checkpoint/Restart failed

2010-12-23 Thread 孟宪军
My main question is: after I finished the checkpoint operation against a simple task which ran on tow machines, I can only restart it on one machine. if I ran the following command to force the ompi-restart to run the program on two machines: *ompi-restart -hostfile ./machine_names ompi_global

[OMPI users] Openmpi Checkpoint/Restart failed

2010-12-23 Thread 孟宪军
Dear all, I had to try the checkpoint/restart function of Openmpi recently, and after several failure and checking lots of the docement, I am still very confused about how to config the checkpoint/restart function. Can anybody give me a $HOME/.openmpi/mca-params.conf script and introduce me what p