Hi: please mention the below output.
cat /etc/redhat-release
OR
cat /etc/lsb_release
Also, please let us know the detailed log reports that is probably
available at /var/log/slurm/slurmctld.log
status of:
ps -ef | grep slurmctld
Thanks & Regards,
Sudeep Narayan Banerjee
System Analyst | Scientist B
Information System Technology Facility
Academic Block 5 | Room 110
Indian Institute of Technology Gandhinagar
Palaj, Gujarat 382355 INDIA
On 11/06/20 5:54 pm, navin srivastava wrote:
Hi Team,
when i am trying to start the slurmd process i am getting the below error.
2020-06-11T13:11:58.652711+02:00 oled3 systemd[1]: Starting Slurm node
daemon...
2020-06-11T13:13:28.683840+02:00 oled3 systemd[1]: slurmd.service:
Start operation timed out. Terminating.
2020-06-11T13:13:28.684479+02:00 oled3 systemd[1]: Failed to start
Slurm node daemon.
2020-06-11T13:13:28.684759+02:00 oled3 systemd[1]: slurmd.service:
Unit entered failed state.
2020-06-11T13:13:28.684917+02:00 oled3 systemd[1]: slurmd.service:
Failed with result 'timeout'.
2020-06-11T13:15:01.437172+02:00 oled3 cron[8094]:
pam_unix(crond:session): session opened for user root by (uid=0)
Slurm version is 17.11.8
The server and slurm is runningĀ from long time and we have not made
any changesĀ but today when i am starting it is giving this error message.
Any idea what could be wrong here.
Regards
Navin.