Re: [slurm-users] Monitoring with Telegraf

2019-09-30 Thread Pablo Llopis
Hi all, If you're using collectd to gather metrics I started writing a slurm collectd plugin at https://github.com/pllopis/collectd/tree/slurm. It provides per-partition info about jobs, node stats, and internal slurm metrics such as backfill stats. In our infra these are shipped to Influx and th

Re: [slurm-users] Monitoring with Telegraf

2019-09-27 Thread Josef Dvoracek
some time ago I wrote this small collector, https://github.com/jose-d/influxdb-collectors/tree/master/slurm_metric_writer. Until you'll write/find better one, feel free to use it, send PRs with improvements, etc :) cheers. josef On 26. 09. 19 17:15, Marcus Boden wrote: Hey everyone, I am

Re: [slurm-users] Monitoring with Telegraf

2019-09-26 Thread Tina Friedrich
I second that question - I'm using the same combination :) I know there's some efforts - see https://slurm.schedmd.com/SLUG16/monitoring_influxdb_slug.pdf - but I don't know exactly what the state of that is at the moment. (I resorted to telegraf's 'execute script' plugin to pump some informat

[slurm-users] Monitoring with Telegraf

2019-09-26 Thread Marcus Boden
Hey everyone, I am using Telegraf and InfluxDB to monitor our hardware and I'd like to include some slurm metrics into this. Is there already a telegraf plugin for monitoring slurm I don't know about, or do I have to start from scratch? Best, Marcus -- Marcus Vincent Boden, M.Sc. Arbeitsgruppe e