[slurm-users] Re: Slurm fails before nvidia-smi command

2024-07-29 Thread Aziz Ogutlu via slurm-users
Jeff *From:* Aziz Ogutlu via slurm-users *Sent:* Monday, July 29, 2024 3:23 AM *To:* slurm-us...@schedmd.com *Subject:* [slurm-users] Slurm fails before nvidia-smi command Hi there all, We have Dell server with 2 x Nvidia H100 and running slurm on it. After r

[slurm-users] Slurm fails before nvidia-smi command

2024-07-29 Thread Aziz Ogutlu via slurm-users
Hi there all, We have Dell server with 2 x Nvidia H100 and running slurm on it. After restart server if we do not write nvidia-smi command slurm fails. When we run nvidia-smi && systemctl restart slurmd && systemctl restart slurmctld , slurm queue begins. Do you have any idea about this error

[slurm-users] MPI_Init_thread error

2023-07-24 Thread Aziz Ogutlu
Hi there all, We're using Slurm 21.08 on Redhat 7.9 HPC cluster with OpenMPI 4.0.3 + gcc 8.5.0. When we run command below for call SU2, we get an error message: /$ srun -p defq --nodes=1 --ntasks-per-node=1 --time=01:00:00 --pty bash -i/ /$ module load su2/7.5.1/ /$ SU2_CFD config.cfg/ /*** An