[slurm-users] Slurm versions 20.11.2 is now available

2020-12-18 Thread Tim Wickberg
We are pleased to announce the availability of Slurm version 20.11.2. This resolves a critical regression from the recent 20.11.1 release which prevented both PMI and PMIx interfaces from functioning correctly. Slurm can be downloaded from https://www.schedmd.com/downloads.php . - Tim -- Tim

Re: [slurm-users] Slurm Upgrade Philosophy?

2020-12-18 Thread Alex Chekholko
Hi Jason, Ultimately each site decides how/why to do it; in my case I tend to do big "forklift upgrades", so I'm running 18.08 on the current cluster and will go to latest SLURM for my next cluster build. But you may have good reasons to upgrade slurm more often on your existing cluster. I don't

[slurm-users] Slurm Upgrade Philosophy?

2020-12-18 Thread Jason Simms
Hello all, Thanks to several helpful members on this list, I think I have a much better handle on how to upgrade Slurm. Now my question is, do most of you upgrade with each major release? I recognize that, normally, if something is working well, then don't upgrade it! In our case, we're running 2

Re: [slurm-users] Defining an empty partition

2020-12-18 Thread Steve Brasier
Thank you Tina, I hadn't realised that would show as "n/a" not "down" in that case (which IMO would have been confusing). For anyone else hitting this I think the minimum you can do is something like: PartitionName=compute Default=YES State=UP Nodes=nosuch NodeName=nosuch The documented approach

Re: [slurm-users] Defining an empty partition

2020-12-18 Thread Tina Friedrich
Yeah, I had that problem as well (trying to set up a partition that didn't have any nodes - they're not here yet). I figured that one can have partitions with nodes that don't exist, though. As in, not even in DNS. I currently have this: [arc-slurm ~]$ sinfo PARTITION AVAILĀ  TIMELIMITĀ  NODES

Re: [slurm-users] Defining an empty partition

2020-12-18 Thread Steve Brasier
Having tried just not even defining any partitions you hit this this check which seems to ensure you can't create a cluster with no nodes. Is it possible to create a control node without any compute nodes, e.g. as part of a s

[slurm-users] Defining an empty partition

2020-12-18 Thread Steve Brasier
Hi all, According to the relevant manpage it's possible to define an empty partition using "Nodes= ". However this doesn't seem to work (slurm 20.2.05): [centos@testohpc-login-0 ~]$ grep -n Partition /etc/slurm/slurm.conf 72:Prior

[slurm-users] Gang scheduling using GPUs ?

2020-12-18 Thread Tilman Schneider
Hello, we are experiencing troubles with gang scheduling once GPUs are added in the consideration. We are using the following slurm.conf settings: ProctrackType=proctrack/cgroup TaskPlugin=task/cgroup SchedulerType=sched/backfill SchedulerTimeSlice=60 SelectType=select/cons_tres SelectTypeParame