Steve,
I suspect you did not install the packages.
You need to install slurm-slurmctld to get the slurmctld systemd files:
/# rpm -qlp slurm-slurmctld-20.11.9-1.el7.x86_64.rpm//
///run/slurm/slurmctld.pid//
/*//usr/lib/systemd/system/slurmctld.service/*/
///usr/sbin/slurmctld//
///usr/share/man/man8/slurmctld.8.gz//
/
The same for slurm-slurmdbd. Both of those are management daemons and
should only be running on one (two if you configure failover) systems.
Your compute nodes need slurm-slurmd, which will provide the systemd
files for slurmd.
On 11/27/2022 7:34 PM, 刘 博涵 wrote:
Hi all,
I'm a newcomer to cluster computing and have been trying to setup a
Slurm cluster myself. Right now I'm stuck at starting up Slurm's
systemd services. I checked out the following tutorials:
1. Slurm Workload Manager - Quick Start Administrator Guide
(schedmd.com) <https://slurm.schedmd.com/quickstart_admin.html>
2. https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_installation/
3. https://wiki.bkslab.org/index.php/Slurm_Installation_Guide
4. Slurm installation (southgreenplatform.github.io)
<https://southgreenplatform.github.io/trainings/hpc/slurminstallation/>
All of them state that I should run /systemctl enable/start
slurmd/slurmdbd/slurmctld/ after installation, however they always
fail because the corresponding systemd config files do not exist,
regardless of whether I installed Slurm from source or from EPEL
repos. All my systems are CentOS 7.9 with the latest updates prior to
Slurm installation, and I was trying to install Slurm 22.05.6 from
source. My question is are the systemd config files actually created
during installation process as the tutorials imply, or do I have to
write them myself? If the latter, then how should I write my slurm
systemd config file (what parameters should I put in etc.), any
templates I can follow?
Many thanks,
Steve