Dear all,
I have figured it out. It was a simple issue, I didn't add the "blcr lib" to
the $PATH environment varable. However, it can make checkpoint operation,
but can't make restart operation successfully. It was so wield.
Best regards
Xianjun Meng
在 2010年12月23日 下午5:35,孟宪军 写道:
> My main ques
My main question is:
after I finished the checkpoint operation against a simple task which ran on
tow machines, I can only restart it on one machine. if I ran the following
command to force the ompi-restart to run the program on two machines:
*ompi-restart -hostfile ./machine_names ompi_global