Hello,

 

I have a heterogeneous GPU Node with one V100 and two RTX cards. When I
request resources with --gres=mps:100, always the V100 is chosen, and jobs
are waiting if the V100 is completely allocated, while RTX cards are free.
If I use --gres=gpu:1, also the RTX cards are used. Is something wrong with
the configuration or is it another problem?

 

The node configuration  in slurm.conf:

NodeName=node1 CPUs=48 RealMemory=128530 Sockets=1 CoresPerSocket=24
ThreadsPerCore=2 Gres=gpu:v100:1,gpu:rtx:2,mps:600 State=UNKNOWN

 

gres.conf:

Name=gpu Type=v100      File=/dev/nvidia0

Name=gpu Type=rtx          File=/dev/nvidia1

Name=gpu Type=rtx          File=/dev/nvidia2

Name=mps Count=200      File=/dev/nvidia0

Name=mps Count=200      File=/dev/nvidia1

Name=mps Count=200      File=/dev/nvidia2

 

Best regards,

Holger

Reply via email to