gres has to be specified in both slurm.conf and gres.conf and gres.conf must be present on the node with the gres. I keep a single cluster wide gres.conf and copy it to all nodes just like slurm.conf. Also, after adding a new gres I think both the slurmctld and the slurmd needs to be restarted.
On Thu, Jun 27, 2019 at 9:35 AM Valerio Bellizzomi <vale...@selnet.org> wrote: > > hello, my node has 2 gpus so I have specified gres=gpus:2 but the > scontrol show node displays this: > > State=IDLE+DRAIN > Reason=gres/gpus count too low (1 < 2) > > > > >