ado: terça-feira, 29 de agosto de 2023 12:40
Para: Slurm User Community List
Assunto: Re: [slurm-users] Slurm Configless error
Hi:
In my experience this usually means the compute node can’t talk to the
slurmctld TCP port on the slurm controller (firewall?), or the controller host
isn’t res
Hi:
In my experience this usually means the compute node can’t talk to the
slurmctld TCP port on the slurm controller (firewall?), or the controller host
isn’t resolving the compute node’s name (short hostname vs FQDN, for example).
I’d look at slurmctld and slurmd logs—you should see a useful
Hi!
I'm encountering the following errors on my node:
Aug 29 12:24:48 n01 slurmd[9484]: error: _fetch_child: failed to fetch remote
configs
Aug 29 12:24:48 n01 slurmd[9483]: error: _establish_configuration: failed to
load configs
Aug 29 12:24:48 n01 slurmd[9483]: error: slurmd initialization