The reason for restarting slurmctld before slurmd on nodes is Moe
Jette's advise in
http://thread.gmane.org/gmane.comp.distributed.slurm.devel/3039
I would recommend
1. Stop slurmctld
2. Update slurm.conf on all nodes
3. Restart slurmctld
4. Start slurmd on the new nodes
/Ole
On 10/23/2017 03:07 PM, Ole Holm Nielsen wrote:
Hi Jin,
I think that I always do your steps 3,4 in the opposite order: Restart
slurmctld, then slurmd on nodes:
> 3. Restart the slurmd on all nodes
> 4. Restart the slurmctld
Since you run a very old Slurm 15.08, perhaps you should upgrade 15.08
-> 16.05 -> 17.02. Soon there will be a 17.11. FYI: I wrote some notes
about upgrading:
https://wiki.fysik.dtu.dk/niflheim/Slurm_installation#upgrading-slurm