Dear slurm users --
I'm new to slurm (somewhat experienced with Grid Engine, though that's
not relevant to this post). I have access to two slurm based clusters,
and have an application that (a) can be _very_long running (more than
8 weeks for one execution, though the compute and I/O demands of
DMTCP might be an option? Pretty sure there are RPMs for it in RHEL/CentOS 7.
Don’t recall it being any trouble to install.
http://dmtcp.sourceforge.net/
On Oct 4, 2019, at 9:47 PM, Eliot Moss
mailto:m...@cs.umass.edu>> wrote:
Dear slurm users --
I'm new to slurm (somewhat experienced with Gr