[slurm-users] Jobs distribution over CPUs

2024-08-09 Thread Rafał Lalik via slurm-users
Hi, I have a very simple computing farm on a single PC with AMD Ryzen 7950X (2x16 cores). I have configured my slurm to use up to 25 CPUs: NodeName=palmer CPUs=25 RealMemory=4 State=UNKNOWN # Boards=1 SocketsPerBoard=1 CoresPerSocket=16 ThreadsPerCore=2 PartitionName=main Nodes=ALL Default

[slurm-users] ODP: Re: _refresh_assoc_mgr_qos_list: no new list given back keeping cached one

2024-08-05 Thread Rafał Lalik via slurm-users
I had the same issue. After upgrading to slurm-24.05.2 problem is solved. Try it. R. Od: andreas.wiedholz--- via slurm-users Wysłane: poniedziałek, 15 lipca 2024 14:32 Do: slurm-users@lists.schedmd.com Temat: [slurm-users] Re: _refresh_assoc_mgr_qos_list: no new

[slurm-users] Re: Issue with starting slurmctld

2024-06-17 Thread Rafał Lalik via slurm-users
Recent compiler-hardening efforts broke slurms way of loading plugins. As a workaround, link slurm with -Wl,-z,lazy -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe send an email to slurm-users-le...@lists.schedmd.com Thanks, this fixed issue for me. Regards, Rafał

[slurm-users] Ill-formed config from the online configurator

2024-06-17 Thread Rafał Lalik via slurm-users
The online configurator for JobAcctGatherType set to anything that none, generates config file with a such entry: #JobAcctGatherTypejobacct_gather/cgroup= or #JobAcctGatherTypejobacct_gather/linux= Regards, Rafał -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsubscribe sen

[slurm-users] cgroup issue on non-systemd system

2024-06-14 Thread Rafał Lalik via slurm-users
Hello, per documentation, it is possible to run slurm on non systemd system with IgnoreSystemd=yes in cgroup.conf. However I had an error with slurmd: error: common_file_write_content: unable to open '/sys/fs/cgroup/system.slice/cgroup.subtree_control' for writing: No such file or directory

[slurm-users] Issue with starting slurmctld

2024-06-14 Thread Rafał Lalik via slurm-users
Hello, I have encountered issues with running slurmctld. From logs, I see these errors: [2024-06-14T17:37:57.587] slurmctld version 24.05.0 started on cluster laura [2024-06-14T17:37:57.587] error: plugin_load_from_file: dlopen(/usr/lib64/slurm/jobacct_gather_cgroup.so): /usr/lib64/slurm/jobac