Hi Slurm users,

When removing dead nodes or adding new nodes, I've too often made the mistake of forgetting to update correctly the /etc/slurm/topology.conf file.

Therefore I wrote a simple "checktopology" tool to check the consistency of /etc/slurm/topology.conf with the nodelist in /etc/slurm/slurm.conf (as presented by the sinfo command). It's available from my Github project at https://github.com/OleHolmNielsen/Slurm_tools/tree/master/nodes

An example output is:

$ checktopology
Extract nodelist from /etc/slurm/topology.conf
Differences between Slurm nodelist and nodes in /etc/slurm/topology.conf
*** /tmp/nodelist.12694 2019-01-21 10:07:01.468864295 +0100
--- /tmp/topologylist.12694     2019-01-21 10:07:01.365857415 +0100
***************
*** 368,373 ****
--- 368,378 ----
  c088
  c089
  c090
+ c091
+ c092
+ c093
+ c094
+ c095
  d001
  d002
  d003
***************
*** 595,600 ****
--- 600,606 ----
  i048
  i049
  i050
+ i051
  x001
  x002
  x003

Comments and suggestions are most welcome!

FYI: My Slurm Wiki contains available information about adding/removing nodes: https://wiki.fysik.dtu.dk/niflheim/SLURM#add-and-remove-nodes

--
Ole Holm Nielsen
Department of Physics, Technical University of Denmark

Reply via email to