First, the gpu is already set shared mode.

I can run job using gpu with the following slurm configuration, I  have one job 
using 1 gpu, I can see CUDA_VISIBLE_DEVICE in the job env.  If I want to run 
another job using the 1 gpus, the job will be pending. How to configure so that 
I can run multi job on the same gpus?
I noticed :no_consume can be added to the Gres, at this time, I can run multi 
jobs, but there is no CUDA_VISIBLE_DEVICE can be found in the job env.
Slurm.conf
NodeName=node1 Gres=gpu:1 CPUs=4 State=UNKNOWN

Thanks.

Jeff (ChaoFeng Zhang, 张超锋) PMP®             
zhang...@lenovo.com<mailto:zhang...@lenovo.com>
HPC&AI | Cloud Software Architect                       (+86) - 18116117420
Software solution development                    (+8621) - 20590223
Shanghai, China

Reply via email to