Thank you for your reply and apologies for not reacting sooner I have
kept busy until now. I have attached our partition definitions to this
mail.
As for your second question MPI jobs aren't really a issue in our
cluster there are a few in between but not nearly enough to explain up
to 20 nod
We’ve run a similar setup since I moved to Slurm 3 years ago, with no issues.
Could you share partition definitions from your slurm.conf?
When you see a bunch of jobs pending, which ones have a reason of “Resources”?
Those should be the next ones to run, and ones with a reason of “Priority” are