Re: [slurm-users] Gres GPU Resource Issue

2020-05-18 Thread Marcus Wagner
Andrew, you could try change it to the following: /etc/slurm/slurm.conf: NodeName=node[1-3]      CPUs=40 RealMemory=48000 Sockets=2 CoresPerSocket=10 ThreadsPerCore=2 Feature="p4000" Gres=gpu:pascal:8 State=UNKNOWN NodeName=node[4-5,7-10] CPUs=8  RealMemory=48000 Sockets=2 CoresPerSocket=4  T

Re: [slurm-users] Gres GPU Resource Issue

2020-05-17 Thread Alex Chekholko
Hi Andrew, I think maybe something is wrong with your slurmd, maybe something missing from your install? On the node (where slurmd is running), you should see a message similar to this in slurmd.log [2020-05-11T14:29:17.766] Gres Name=gpu Type=titanrtx Count=4 ID=7696487 File=/dev/nvidia[0-3] (n

[slurm-users] Gres GPU Resource Issue

2020-05-15 Thread Speer, Andrew
I've run into a bit of an issue when trying to define GPU's in our slurm conf. Any insight is appreciated. Hopefully relevant lines from the configs below. Error: [2020-05-15T16:35:14.862] error: gres_plugin_node_config_unpack: No plugin configured to process GRES data from node node3 (Name:gpu