On 8/14/20 6:17 am, Stefan Staeglich wrote:
what's the current status of the checkpointing support in SLURM?
There isn't any these days, there used to be support for BLCR but that's been dropped as BLCR is no more.
I know from talking with SchedMD they are of the opinion that any current checkpoint/resume code (such as DMTCP [1]) should be supported via the users batch script and not in Slurm itself.
All the best, Chris [1] - https://github.com/dmtcp/dmtcp -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA