Andrew,
you could try change it to the following:
/etc/slurm/slurm.conf:
NodeName=node[1-3] CPUs=40 RealMemory=48000 Sockets=2
CoresPerSocket=10 ThreadsPerCore=2 Feature="p4000" Gres=gpu:pascal:8
State=UNKNOWN
NodeName=node[4-5,7-10] CPUs=8 RealMemory=48000 Sockets=2
CoresPerSocket=4 T
Hi Andrew,
I think maybe something is wrong with your slurmd, maybe something missing
from your install?
On the node (where slurmd is running), you should see a message similar to
this in slurmd.log
[2020-05-11T14:29:17.766] Gres Name=gpu Type=titanrtx Count=4 ID=7696487
File=/dev/nvidia[0-3] (n
I've run into a bit of an issue when trying to define GPU's in our slurm conf.
Any insight is appreciated.
Hopefully relevant lines from the configs below.
Error:
[2020-05-15T16:35:14.862] error: gres_plugin_node_config_unpack: No plugin
configured to process GRES data from node node3 (Name:gpu