Re: [slurm-users] [ext] Re: bufferoverflow in slurmd with acct_gather_energy plugin

2023-08-29 Thread Ole Holm Nielsen
Hi Magnus, On 29-08-2023 13:56, Hagdorn, Magnus Karl Moritz wrote: I'm curious to learn about your energy gathering method:  How do you extract node power using IPMI using FreeIMPI (or some other toolset), and how do you configure Slurm for this? We are using the SLURM plugin which is enabled

Re: [slurm-users] Slurm Configless error

2023-08-29 Thread Nicolas Sonoda
Hello Paul! Thank you for the response! It's strange because in the slurmctld log, the error I'm getting is: error: _slurm_rpc_config_request: Rejected request as configless is disabled Thanks! Nícolas De: slurm-users em nome de Paul Brunk Enviado: terça-feira

Re: [slurm-users] Slurm Configless error

2023-08-29 Thread Paul Brunk
Hi: In my experience this usually means the compute node can’t talk to the slurmctld TCP port on the slurm controller (firewall?), or the controller host isn’t resolving the compute node’s name (short hostname vs FQDN, for example). I’d look at slurmctld and slurmd logs—you should see a useful

[slurm-users] Slurm Configless error

2023-08-29 Thread Nicolas Sonoda
Hi! I'm encountering the following errors on my node: Aug 29 12:24:48 n01 slurmd[9484]: error: _fetch_child: failed to fetch remote configs Aug 29 12:24:48 n01 slurmd[9483]: error: _establish_configuration: failed to load configs Aug 29 12:24:48 n01 slurmd[9483]: error: slurmd initialization

Re: [slurm-users] [ext] Re: bufferoverflow in slurmd with acct_gather_energy plugin

2023-08-29 Thread Hagdorn, Magnus Karl Moritz
Hi Ole, On Tue, 2023-08-29 at 11:08 +0200, Ole Holm Nielsen wrote: > > I'm curious to learn about your energy gathering method:  How do you > extract node power using IPMI using FreeIMPI (or some other toolset), > and > how do you configure Slurm for this? > We are using the SLURM plugin whic

Re: [slurm-users] bufferoverflow in slurmd with acct_gather_energy plugin

2023-08-29 Thread Ole Holm Nielsen
Hi Magnus, On 8/28/23 10:16, Hagdorn, Magnus Karl Moritz wrote: we recently enabled the energy gathering plugin on using the IPMI gatherer with libfreeipmi. We are running the latest slurm 23.02.4 on rocky 8.5. We are getting sporadic buffer overflows in slurmd when it is trying to query the IPM