Hi all!
In short, I'm thinking about housing some slurmd-used libs in an NFS
share, and am curious about the risk such sharedness offers to
job-running slurmds (not concerned about the jobs themselves here).
For our next Slurm deployment (not a rolling upgrade), our Rocky8
nodes will be 'statelit
Hi Hermann,
no, we don't use --propagate.
in slurm.conf, we set
PropagateResourceLimits=CORE
That in fact means, that we really do not propagate any limits besides
the coresize (excerpt from slurm.conf manpage):
If neither PropagateResourceLimits or PropagateResourceLimitsExcept
are config
If you use AzureAD as your identity provider beware that their JWKS json
doesn't contain the alg parameter.
We opened an issue: https://bugs.schedmd.com/show_bug.cgi?id=16168 and it
is confirmed.
As a workaround you can use this jq query to add the alg to the jwks json
that you get from AzureAD:
cu
Hi,
I am trying to configure SLURM to use external authentication for JWT as
described in the documentation.
https://slurm.schedmd.com/jwt.html
JWT Authentication worked when I tested the setup for standalone use but
am having difficulty with tokens from our oauth provider.
My first questi
Hi Marcus,
I am not sure if this is helpful but from looking at the source code of
Slurm (line 276 of src/slurmd/slurmstepd/ulimits.c in version 22.05) it
looks like you are explicitly using
"--propagate..."
to set resource limits (the one you see when running
"ulimit -a") on the workers the s
Hi Folks,
has anyone ever stumbled upon such an error:
slurmstepd: error: Can't propagate RLIMIT_NPROC of 767202 from submit
host: Invalid argument
Anyone knows, where that comes from?
Any hints are welcome.
Best
Marcus
smime.p7s
Description: S/MIME Cryptographic Signature