Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Rafał Kędziorski
o.k. thx for the explanation. Am Fr., 27. Sept. 2019 um 15:38 Uhr schrieb Steffen Grunewald < steffen.grunew...@aei.mpg.de>: > On Fri, 2019-09-27 at 14:58:40 +0200, Rafał Kędziorski wrote: > > Am Fr., 27. Sept. 2019 um 13:50 Uhr schrieb Steffen Grunewald < > > steffen.grunew...@aei.mpg.de>: > > >

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Steffen Grunewald
On Fri, 2019-09-27 at 14:58:40 +0200, Rafał Kędziorski wrote: > Am Fr., 27. Sept. 2019 um 13:50 Uhr schrieb Steffen Grunewald < > steffen.grunew...@aei.mpg.de>: > > On Fri, 2019-09-27 at 11:19:16 +0200, Juergen Salk wrote: > > > > > > you may try setting `ReturnToService=2´ in slurm.conf. > > > > >

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Juergen Salk
* Rafał Kędziorski [190927 14:58]: > > > > > > you may try setting `ReturnToService=2´ in slurm.conf. > > > > > > > Caveat: A spontaneously rebooting machine may create a "black hole" this > > way. > > > > How do you mean this? Could ReturnToService=2 be a problem? > Hi Rafał, black hole syndr

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Rafał Kędziorski
Am Fr., 27. Sept. 2019 um 13:50 Uhr schrieb Steffen Grunewald < steffen.grunew...@aei.mpg.de>: > On Fri, 2019-09-27 at 11:19:16 +0200, Juergen Salk wrote: > > Hi Rafał, > > > > you may try setting `ReturnToService=2´ in slurm.conf. > > > > Best regards > > Jürgen > > Caveat: A spontaneously reboot

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Steffen Grunewald
On Fri, 2019-09-27 at 11:19:16 +0200, Juergen Salk wrote: > Hi Rafał, > > you may try setting `ReturnToService=2´ in slurm.conf. > > Best regards > Jürgen Caveat: A spontaneously rebooting machine may create a "black hole" this way. - Steffen -- Steffen Grunewald, Cluster Administrator Max P

Re: [slurm-users] Monitoring with Telegraf

2019-09-27 Thread Josef Dvoracek
some time ago I wrote this small collector, https://github.com/jose-d/influxdb-collectors/tree/master/slurm_metric_writer. Until you'll write/find better one, feel free to use it, send PRs with improvements, etc :) cheers. josef On 26. 09. 19 17:15, Marcus Boden wrote: Hey everyone, I am

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Juergen Salk
Hi Rafał, you may try setting `ReturnToService=2´ in slurm.conf. Best regards Jürgen -- Jürgen Salk Scientific Software & Compute Services (SSCS) Kommunikations- und Informationszentrum (kiz) Universität Ulm Telefon: +49 (0)731 50-22478 Telefax: +49 (0)731 50-22471 * Rafał Kędziorski [190927

Re: [slurm-users] After reboot nodes are in state = down

2019-09-27 Thread Rafał Kędziorski
Hi Andreas, my Cluster is not running whole time. I call just sudo shutdown. And after boot the nodes are in state down. I'm using Slurn on Raspi Cluster (5* Pi 4). What is the best way to shutdown the nodes that after boot the nodes are idle and not down? Regards, Rafal Am Fr., 27. Sept. 2019