All, A simple user SLURM question. Recently I had a need to request a certain range of nodes on a cluster, and I found SLURM was acting against my assumption, but exactly as the man page says.
Namely, I needed say, 20 nodes on a cluster on the same rack (benchmarking!), but I suspected that some node was down for various reasons. So that if I tried something a la: sbatch -N 20 --nodelist=nodeX[001-020] the job would be stuck pending. My naïve thought was oh, okay, I'll just ask for a bigger swath, like the whole rack (say 50 nodes): sbatch -N 20 --nodelist=nodeX[001-050] and it will give me 20 nodes within that range, and skip the one that's down. But that lead to: sbatch: error: invalid number of nodes (-N 50-20) I then read the man page and found, that nodelist will let you do the "under": sbatch -N 20 --nodelist=nodeX[001-010] but not the "over". Instead I needed to do: sbatch -N 20 --nodelist=nodeX[001-011,13-21] after finding out, for example, node 12 was down. So, my question is, is there a way to say, "Please give me X nodes inside this specific range of nodes?" Thanks, Matt -- Matt Thompson, SSAI, Ld Scientific Programmer/Analyst NASA GSFC, Global Modeling and Assimilation Office Code 610.1, 8800 Greenbelt Rd, Greenbelt, MD 20771 Phone: 301-614-6712 Fax: 301-614-6246 http://science.gsfc.nasa.gov/sed/bio/matthew.thompson