[slurm-users] not able to run mpi jobs

2022-03-22 Thread masber masber
Dear slurm community, I am quite new to slurm but I got a small slurm cluster with 3 compute nodes running. I can run simple jobs like `srun -N3 hostname` and I am trying now to run an mpi helloworld app. My issue is that the job hangs and fails after a few seconds. # srun -N2 -n4 /scratch/hel

[slurm-users] step creation temporarily disabled, retrying (Requested nodes are busy)

2022-03-01 Thread masber masber
Dear slurm user community, I have a slurm cluster on centos7 installed through yum, I also have mpich installed. I can ssh into on of the nodes and run an mpi job: # /usr/lib64/mpich/bin/mpirun --hosts nid001001-bae562bc0bd98e50ad5c03200efaf799d6e82469,nid001002-bae562bc0bd98e50ad5c03200efaf79