On 11/16/21 7:07 am, Jaep Emmanuel wrote: > root@ecpsc10:~# scontrol show node ecpsc10 [...] > State=DOWN ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A [...]
Reason=Node unexpectedly rebooted [slurm@2021-11-16T14:41:04]
This is why the node isn't considered available, as others have already noted you will need to resume the node. All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA