Re: [slurm-users] [External] Re: Troubleshooting job stuck in Pending state

2023-12-12 Thread Bernstein, Noam CIV USN NRL (6393) Washington DC (USA)
Presumably what's in the squeue Reason column isn't rnough? It's not particularly informative, although it does distinguish "Resources" from "Priority", for example, and it'll also list various partition limits, e.g.

Re: [slurm-users] [External] Re: Troubleshooting job stuck in Pending state

2023-12-12 Thread Davide DelVento
I am not a Slurm expert by any stretch of the imagination, so my answer is not authoritative. That said, I am not aware of any functional equivalent for Slurm, and I would love to learn that I am mistaken! On Tue, Dec 12, 2023 at 1:39 AM Pacey, Mike wrote: > Hi Davide, > > > > The jobs do event

Re: [slurm-users] [External] Re: Troubleshooting job stuck in Pending state

2023-12-12 Thread Pacey, Mike
Hi Davide, The jobs do eventually run, but can take several minutes or sometimes several hours to switch to a running state even when there’s plenty of resources free immediately. With Grid Engine it was possible to turn on scheduling diagnostics and get a summary of the scheduler’s decisions