On 8/14/20 6:17 am, Stefan Staeglich wrote:

what's the current status of the checkpointing support in SLURM?

There isn't any these days, there used to be support for BLCR but that's been dropped as BLCR is no more.

I know from talking with SchedMD they are of the opinion that any current checkpoint/resume code (such as DMTCP [1]) should be supported via the users batch script and not in Slurm itself.

All the best,
Chris

[1] - https://github.com/dmtcp/dmtcp

--
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA

Reply via email to