Hi,
I have a very simple computing farm on a single PC with AMD Ryzen 7950X (2x16
cores).
I have configured my slurm to use up to 25 CPUs:
NodeName=palmer CPUs=25 RealMemory=4 State=UNKNOWN # Boards=1
SocketsPerBoard=1 CoresPerSocket=16 ThreadsPerCore=2
PartitionName=main Nodes=ALL Default
I had the same issue. After upgrading to slurm-24.05.2 problem is solved. Try
it.
R.
Od: andreas.wiedholz--- via slurm-users
Wysłane: poniedziałek, 15 lipca 2024 14:32
Do: slurm-users@lists.schedmd.com
Temat: [slurm-users] Re: _refresh_assoc_mgr_qos_list: no new
Recent compiler-hardening efforts broke slurms way of loading plugins.
As a workaround, link slurm with -Wl,-z,lazy
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-le...@lists.schedmd.com
Thanks, this fixed issue for me.
Regards,
Rafał
The online configurator for JobAcctGatherType set to anything that none,
generates config file with a such entry:
#JobAcctGatherTypejobacct_gather/cgroup=
or
#JobAcctGatherTypejobacct_gather/linux=
Regards,
Rafał
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe sen
Hello,
per documentation, it is possible to run slurm on non systemd system with
IgnoreSystemd=yes in cgroup.conf.
However I had an error with slurmd:
error: common_file_write_content: unable to open
'/sys/fs/cgroup/system.slice/cgroup.subtree_control' for writing: No such file
or directory
Hello,
I have encountered issues with running slurmctld.
From logs, I see these errors:
[2024-06-14T17:37:57.587] slurmctld version 24.05.0 started on cluster laura
[2024-06-14T17:37:57.587] error: plugin_load_from_file: dlopen(/usr/lib64/slurm/jobacct_gather_cgroup.so):
/usr/lib64/slurm/jobac