I checked what you were suggesting: both the controllers can communicate
without any problem to all the nodes.
Today I tried multiple times the dynamics of takeover between the
primary and the backup controller and I noticed that
the first scontrol takeover works perfectly: the backup controlle
Slurm major releases are moving to a six month release cycle. This
change starts with the upcoming Slurm 24.05 release this May. Slurm
24.11 will follow in November 2024. Major releases then continue every
May and November in 2025 and beyond.
There are two main goals of this change:
- Faster