All,

A simple user SLURM question. Recently I had a need to request a certain range 
of nodes on a cluster, and I found SLURM was acting against my assumption, but 
exactly as the man page says.

Namely, I needed say, 20 nodes on a cluster on the same rack (benchmarking!), 
but I suspected that some node was down for various reasons. So that if I tried 
something a la:

  sbatch -N 20 --nodelist=nodeX[001-020]

the job would be stuck pending. My naïve thought was oh, okay, I'll just ask 
for a bigger swath, like the whole rack (say 50 nodes):

 sbatch -N 20 --nodelist=nodeX[001-050]

and it will give me 20 nodes within that range, and skip the one that's down. 
But that lead to:

  sbatch: error: invalid number of nodes (-N 50-20)

I then read the man page and found, that nodelist will let you do the "under":

  sbatch -N 20 --nodelist=nodeX[001-010]

but not the "over". Instead I needed to do:

  sbatch -N 20 --nodelist=nodeX[001-011,13-21]

after finding out, for example, node 12 was down.

So, my question is, is there a way to say, "Please give me X nodes inside this 
specific range of nodes?"

Thanks,
Matt
--
Matt Thompson, SSAI, Ld Scientific Programmer/Analyst
NASA GSFC,    Global Modeling and Assimilation Office
Code 610.1,  8800 Greenbelt Rd,  Greenbelt,  MD 20771
Phone: 301-614-6712                 Fax: 301-614-6246
http://science.gsfc.nasa.gov/sed/bio/matthew.thompson
  • [slurm-u... Thompson, Matt (GSFC-610.1)[SCIENCE SYSTEMS AND APPLICATIONS INC]
    • Re:... Thompson, Matt (GSFC-610.1)[SCIENCE SYSTEMS AND APPLICATIONS INC]
      • ... Em Dragowsky

Reply via email to