First, the gpu is already set shared mode. I can run job using gpu with the following slurm configuration, I have one job using 1 gpu, I can see CUDA_VISIBLE_DEVICE in the job env. If I want to run another job using the 1 gpus, the job will be pending. How to configure so that I can run multi job on the same gpus? I noticed :no_consume can be added to the Gres, at this time, I can run multi jobs, but there is no CUDA_VISIBLE_DEVICE can be found in the job env. Slurm.conf NodeName=node1 Gres=gpu:1 CPUs=4 State=UNKNOWN
Thanks. Jeff (ChaoFeng Zhang, 张超锋) PMP® zhang...@lenovo.com<mailto:zhang...@lenovo.com> HPC&AI | Cloud Software Architect (+86) - 18116117420 Software solution development (+8621) - 20590223 Shanghai, China