[slurm-users] Help: Invitation for Presentation on SLURM in HPC Open Source Software Stack Forum in HPC China 2018 in Qingdao City, 20th Oct

2018-07-31 Thread 巍才凌
Dear Slurm : > > I am a HPC Specialist in Dell EMC China. I am asked by HPC China 2018 to > organize a Forum named “the 3rd HPC Open Source Software Forum” in HPC China > 2018 to be held in Qing Dao City , Shandong Province, > http://hpcchina2018.csp.escience.cn/dct/page/70033 . Last year, I

Re: [slurm-users] Unable to contact slurm controller

2018-07-31 Thread Mahmood Naderan
Thank you very much. It seems that there was an unknown control character in one of the config files which I couldn't see that in the editor. Regards, Mahmood On Tue, Jul 31, 2018 at 10:22 PM, Hadrian Djohari wrote: > Look at /var/log/slurm/slurmctld.log > >

Re: [slurm-users] Unable to contact slurm controller

2018-07-31 Thread Hadrian Djohari
Look at /var/log/slurm/slurmctld.log On Tue, Jul 31, 2018 at 1:23 PM, Mahmood Naderan wrote: > I don't know what happened. It seems that it had been crashed before > > [root@rocks7 ~]# systemctl status slurmctld -l > ● slurmctld.service - Slurm controller daemon >Loaded: loaded (/usr/lib/sys

Re: [slurm-users] Unable to contact slurm controller

2018-07-31 Thread Mahmood Naderan
I don't know what happened. It seems that it had been crashed before [root@rocks7 ~]# systemctl status slurmctld -l ● slurmctld.service - Slurm controller daemon Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled; vendor preset: disabled) Active: failed (Result: exit-code) si

Re: [slurm-users] Unable to contact slurm controller

2018-07-31 Thread Alex Chekholko
Seems like your slurmctld is not running. Have you checked its log to see why? On Tue, Jul 31, 2018 at 8:35 AM Mahmood Naderan wrote: > Hi, > It seems that squeue is broken due to the following error: > > [root@rocks7 ~]# squeue > slurm_load_jobs error: Unable to contact slurm controller (conne

[slurm-users] Unable to contact slurm controller

2018-07-31 Thread Mahmood Naderan
Hi, It seems that squeue is broken due to the following error: [root@rocks7 ~]# squeue slurm_load_jobs error: Unable to contact slurm controller (connect failure) [root@rocks7 ~]# systemctl restart slurmd [root@rocks7 ~]# systemctl restart slurmctld [root@rocks7 ~]# squeue slurm_load_jobs error: