Nice!
Did not know that, good to know, thanks.
On 25/02/2025 10:13, Kevin Buckley
wrote:
On
2025/02/20 21:55, Daniel Letai via slurm-users wrote:
...
Adding AccountingStorageBackupHost pointing to the other node is
of course
possible, but will mean different slurm.conf files which slurm
will complain
about.
Just thought to note that, in general, it is useful to be aware
that one way to avoid Slurm complaining about per-host differences
is to have your slurm.conf Include a file, containing the
different
per-host settings,
So, you have a line
Include /etc/slurm/slurm-acct_strge_backup_host.conf
in the slurm.conf on both hosts,
but have different file content, in this case the address in
the one line
AccountingStorageBackupHost=IP.AD.RE.SS
in the included file on each of the two hosts.
The SlurmCtld won't complain about that, but the SlurmDs will run
against a different config on each of the nodes.
Background:
Older Crays used to have some Slurm infrastructure running on a
node,
"inside the Cray box", that was accessed via different IP
addresses,
depending on whether you were a compute node, so "in-the-box" or
an
"eLogin" node, so "out-of-the-box" and that was how we overcame
that.
We use the same construct now (on newer HPE/Crays) for Account
Gathering
where not all node hardware supports it, and so we can include
AcctGatherEnergyType=acct_gather_energy/none
or
AcctGatherEnergyType=acct_gather_energy/pm_counters
depending on the node.
Same slurm.conf: no complaining from the SlurmCtld.
|
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-le...@lists.schedmd.com