"gzzh...@buaa.edu.cn" <gzzh...@buaa.edu.cn> writes: > Hi Team > I am trying to use the MPI to do some test and study on the C/R > enabled debugging. Professor Josh Hursey said that the feature never > made it into a release so it was only ever available on the trunk, > However , since that time the C/R functionality has fallen into > disrepair. It is most likely broken in the trunk today. T tried with > the current openmpi-master sourcecode, it can be configure, but can't > be make successful because bugs still existing according to the log. > Is there any possible that the history openmpi-developer code which > supports C/R enabled debugging can be download . I appreciate your > offer to help us .
This does seem an important deficiency, and a good reason to stay with 1.6 or use mpich. However, DMTCP is supposed to be able to checkpoint over TCP and Infiniband without any extra support. I'm intending to try it soon and would be interested to know any relevant experience. There used to be a note about not working over IB with some OMPI implementation detail (URC?) but I can't find that now, and the web site implies it should work. See http://dmtcp.sourceforge.net/