Dear Slurm :
>
> I am a HPC Specialist in Dell EMC China. I am asked by HPC China 2018 to
> organize a Forum named “the 3rd HPC Open Source Software Forum” in HPC China
> 2018 to be held in Qing Dao City , Shandong Province,
> http://hpcchina2018.csp.escience.cn/dct/page/70033 . Last year, I
Thank you very much. It seems that there was an unknown control character
in one of the config files which I couldn't see that in the editor.
Regards,
Mahmood
On Tue, Jul 31, 2018 at 10:22 PM, Hadrian Djohari wrote:
> Look at /var/log/slurm/slurmctld.log
>
>
Look at /var/log/slurm/slurmctld.log
On Tue, Jul 31, 2018 at 1:23 PM, Mahmood Naderan
wrote:
> I don't know what happened. It seems that it had been crashed before
>
> [root@rocks7 ~]# systemctl status slurmctld -l
> ● slurmctld.service - Slurm controller daemon
>Loaded: loaded (/usr/lib/sys
I don't know what happened. It seems that it had been crashed before
[root@rocks7 ~]# systemctl status slurmctld -l
● slurmctld.service - Slurm controller daemon
Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled;
vendor preset: disabled)
Active: failed (Result: exit-code) si
Seems like your slurmctld is not running. Have you checked its log to see
why?
On Tue, Jul 31, 2018 at 8:35 AM Mahmood Naderan
wrote:
> Hi,
> It seems that squeue is broken due to the following error:
>
> [root@rocks7 ~]# squeue
> slurm_load_jobs error: Unable to contact slurm controller (conne
Hi,
It seems that squeue is broken due to the following error:
[root@rocks7 ~]# squeue
slurm_load_jobs error: Unable to contact slurm controller (connect failure)
[root@rocks7 ~]# systemctl restart slurmd
[root@rocks7 ~]# systemctl restart slurmctld
[root@rocks7 ~]# squeue
slurm_load_jobs error: