[slurm-users] Slurm 23.11 - Unknown system variable 'wsrep_on'

2024-04-02 Thread Russell Jones via slurm-users
Hi all, I am working on upgrading a Slurm cluster from 20 -> 23. I was successfully able to upgrade to 22, however now that I am trying to go from 22 to 23, starting slurmdbd results in the following error being logged: error: mysql_query failed: 1193 Unknown system variable 'wsrep_on' When try

[slurm-users] Re: File-less NVIDIA GeForce 4070 Ti being removed from GRES list

2024-04-02 Thread Reed Dier via slurm-users
Assuming that you have the cuda drivers installed correctly (nvidia-smi for instance), You should create a gres.conf with just this line: > AutoDetect=nvml If that doesn’t automagically begin working, you can increase the verbosity of slurmd with > SlurmdDebug=debug2 It should then print a bu

[slurm-users] File-less NVIDIA GeForce 4070 Ti being removed from GRES list

2024-04-02 Thread Shooktija S N via slurm-users
Hi, I am trying to set up Slurm (version 22.05) on a 3 node cluster each having an NVIDIA GeForce RTX 4070 Ti GPU. I tried to follow along with the GRES setup tutorial on the Schedmd website and added the following (Gres=gpu:RTX4070TI:1) to the Node configuration in /etc/slurm/slurm.conf: NodeNam