On 5/29/22 3:09 pm, byron wrote:
This is the first time I've done an upgrade of slurm and I had been
hoping to do a rolling upgrade as opposed to waiting for all the jobs to
finish on all the compute nodes and then switching across but I dont see
how I can do it with this setup. Does any one have any expereience of this?
We do rolling upgrades with:
scontrol reboot ASAP nextstate=resume reason="some-useful-reason"
[list-of-nodes]
But you do need to have RebootProgram defined and an appropriate
ResumeTimeout set to allow enough time for your node to reboot (and of
course your system must be configured to boot into a production ready
state when rebooted, including starting up slurmd).
All the best,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA