Hi Davide,
On 8/22/24 21:30, Davide DelVento via slurm-users wrote:
I am confused by the reported amount of Down and PLND Down by sreport.
According to it, our cluster would have had a significant amount of
downtime, which I know didn't happen (or, according to the documentation
"time that slurmctld was not responding", see
https://slurm.schedmd.com/sreport.html
<https://slurm.schedmd.com/sreport.html>)
Could it be my purge settings causing this problem? How can I check (maybe
in some logs, maybe in the future) if actually slurmctld was not
responding? The expected long-term numbers should be less than the ones
reported for last month when we had an issue with a few nodes....
Which version of Slurm are you using? There was an sreport bug that
should be fixed in 23.11: https://support.schedmd.com/show_bug.cgi?id=17689
/Ole
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-le...@lists.schedmd.com