[slurm-users] slurmd-used libs in an NFS share?

2023-03-23 Thread Paul Brunk
Hi all! In short, I'm thinking about housing some slurmd-used libs in an NFS share, and am curious about the risk such sharedness offers to job-running slurmds (not concerned about the jobs themselves here). For our next Slurm deployment (not a rolling upgrade), our Rocky8 nodes will be 'statelit

Re: [slurm-users] RLIMIT_NPROCS

2023-03-23 Thread Wagner, Marcus
Hi Hermann, no, we don't use --propagate. in slurm.conf, we set PropagateResourceLimits=CORE That in fact means, that we really do not propagate any limits besides the coresize (excerpt from slurm.conf manpage): If neither PropagateResourceLimits or PropagateResourceLimitsExcept are config

Re: [slurm-users] External Authentication Integration with JWKS and RS256 Tokens

2023-03-23 Thread Ümit Seren
If you use AzureAD as your identity provider beware that their JWKS json doesn't contain the alg parameter. We opened an issue: https://bugs.schedmd.com/show_bug.cgi?id=16168 and it is confirmed. As a workaround you can use this jq query to add the alg to the jwks json that you get from AzureAD: cu

[slurm-users] External Authentication Integration with JWKS and RS256 Tokens

2023-03-23 Thread Laurence
Hi, I am trying to configure SLURM to use external authentication for JWT as described in the documentation. https://slurm.schedmd.com/jwt.html JWT Authentication worked when I tested the setup for standalone use but am having difficulty with tokens from our oauth provider. My first questi

Re: [slurm-users] RLIMIT_NPROCS

2023-03-23 Thread Hermann Schwärzler
Hi Marcus, I am not sure if this is helpful but from looking at the source code of Slurm (line 276 of src/slurmd/slurmstepd/ulimits.c in version 22.05) it looks like you are explicitly using "--propagate..." to set resource limits (the one you see when running "ulimit -a") on the workers the s

[slurm-users] RLIMIT_NPROCS

2023-03-23 Thread Wagner, Marcus
Hi Folks, has anyone ever stumbled upon such an error: slurmstepd: error: Can't propagate RLIMIT_NPROC of 767202 from submit host: Invalid argument Anyone knows, where that comes from? Any hints are welcome. Best Marcus smime.p7s Description: S/MIME Cryptographic Signature