Re: [OMPI users] OpenMPI + BLCR + LSF integration

2013-03-31 Thread Ralph Castain
Ah, now that's a little different failure mode than your original description. If it works without CR enabled, then the launcher is working just fine. The problem is in the checkpoint/restart integration. There are some things that get initialized differently under CR, but I have no idea what t

Re: [OMPI users] OpenMPI + BLCR + LSF integration

2013-03-30 Thread Ralph Castain
It is quite likely that the lsf integration on the 1.6 series is broken. We don't have a way to test it any more (all our LSF access is gone). I recently was briefly given access to an LSF machine and fixed it for the 1.7 series, but that series doesn't support checkpoint/restart. On Mar 30, 2