close compute node hosts that show
the error. A reboot clears the condition.
-Original Message-
From: slurm-users On Behalf Of Matthew
BETTINGER
Sent: Tuesday, July 28, 2020 12:53 AM
To: Slurm User Community List
Subject: [slurm-users] Slurmstepd errors
Hello,
Running slurm 17.02.6
Hello,
Running slurm 17.02.6 on a cray system and all of a sudden we have been
receiving these message errors from slurmstepd. Not sure what triggers this?
srun -N 4 -n 4 hostname
nid00031
slurmstepd: error: task/cgroup: unable to add task[pid=903] to memory cg
'(null)'
nid00029
nid00030
slurm