Vladimir, A while back the best cluster monitoring tool was Ganglia ( http://ganglia.sourceforge.net/), but it has not been maintained for several years. There are quite a few alternatives out there, I found nightingale (https://github.com/didi/nightingale) to be simple to install and use.
Good luck, George. On Fri, Apr 8, 2022 at 6:09 AM Vladimir Nikishkin via users < users@lists.open-mpi.org> wrote: > Hello, everyone. > > Sorry if my message is somehow off-topic, but googling returns too many > results, rather than too few, so I would like to ask for someone's > personal experience. > > So, I have a cluster of a few identically set up machines with a shared > NFS space. > I would like to have some visualisation of how this cluster is used. > E.g., how many machines are up, how many are down, how much memory is > available on each node, how the cluster performance changes with time > (e.g. total bogomips, total memory), ping to each machine, et cetera. > > Can someone recommend some openmpi-oriented monitoring software for such > a use case? > > -- > Your sincerely, > Vladimir Nikishkin (MiEr, lockywolf) > (Laptop) >