[slurm-users] cgroups/v2 plugin rpmbuild issue

2024-07-05 Thread Chris Taylor via slurm-users
Trying to use rpmbuild on Rocky9 Linux, Slurm 21.08 - I want to build with cgroups/v2 support and have these installed: libbpf-devel.x86_642:1.3.0-2.el9 dbus-devel.x86_64 1:1.12.20-8.el9 kernel-headers.x86_64 5.14.0

[slurm-users] Run a program via Strigger when a node joins the cluster

2024-07-05 Thread Karri Vrkreddy via slurm-users
Hi,   We have a requirement to run a specific program whenever any new node joins the slurm cluster.  For this, we have tried using strigger with the following options :   "strigger --set --node --up --flags=perm --program="   We see that the trigger is not getting activated always. Checked slu

[slurm-users] Re: Using sharding

2024-07-05 Thread Reed Dier via slurm-users
I would try specifying cpus and mem just to be sure its not requesting 0/all. Also, I was running into a weird issue when I had oversubscribe=yes:2 causing odd issues in my lab cluster when playing with shards, where they would go pending resources despite no alloc of gpu/shards. Once I reverted

[slurm-users] Re: Using sharding

2024-07-05 Thread Ward Poelmans via slurm-users
Hi Arnuld, On 5/07/2024 13:56, Arnuld via slurm-users wrote: It should show up like this:     Gres=gpu:gtx_1080_ti:4(S:0-1),shard:gtx_1080_ti:16(S:0-1) What's the meaning of (S:0-1) here? The sockets to which the GPUs are associated: If GRES are associated with specific sockets, t

[slurm-users] Re: Using sharding

2024-07-05 Thread Arnuld via slurm-users
> On Fri, Jul 5, 2024 at 12:19 PM Ward Poelmans > via slurm-users wrote: > Hi Ricardo, > > It should show up like this: > > Gres=gpu:gtx_1080_ti:4(S:0-1),shard:gtx_1080_ti:16(S:0-1) > What's the meaning of (S:0-1) here? -- slurm-users mailing list -- slurm-users@lists.schedmd.com To unsub