Hi all,
I'm trying to get working the gathering of gres/gpumem and
gres/gpuutil on Slurm 23.02.2 , but with no success yet.
We have:
AccountingStorageTRES=cpu,mem,gres/gpu
in the slurm.conf and Slurm is build with NVML support.
Autodetect=NVML
in gres.conf
gres/gpumem and gres/gpuutil now a
Hi,
we want to push our users to run jobs with high GPU utilization.
Because it's difficult for users to get GPU utilization of their jobs, I
have decided to write script, which prints utilization of running jobs.
The idea is simple:
1. get list of running jobs in GPU partitions
2. get ID