Hi Daniel,

>  error: Unable to contact slurm controller (connect failure)
> 
> I appreciate any insight on what could be the cause.

Can you check that the slurmctld is up and running, and that the said
commands work on the controller machine itself?
If the slurmctld cannot be started as a service, try to run it in verbose
debug mode (-D -vvv) and find out what might be wrong with it.
If it runs in foreground, check the systemd service again.
Proceed to compute nodes only when you are sure that the ctld is OK.
(IIRC there was a flag in the systemd service definition that had to be
adjusted after an upgrade, maybe you're hitting the same?)

Best,
 Steffen

-- 
Steffen Grunewald, Cluster Administrator
Max Planck Institute for Gravitational Physics (Albert Einstein Institute)
Am Mühlenberg 1 * D-14476 Potsdam-Golm * Germany
~~~
Fon: +49-331-567 7274
Mail: steffen.grunewald(at)aei.mpg.de
~~~

-- 
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

Reply via email to