Even for users other than slurm and munge?  It seems strange that 3 of 4 worker 
nodes work with the same UIDs/GIDs as the non-working nodes.

-----Original Message-----
From: slurm-users <slurm-users-boun...@lists.schedmd.com> On Behalf Of 
Christopher Samuel
Sent: Wednesday, April 22, 2020 2:27 PM
To: slurm-users@lists.schedmd.com
Subject: Re: [slurm-users] Munge decode failing on new node

On 4/22/20 12:56 PM, dean.w.schu...@gmail.com wrote:

> There is a third user account on all machines in the cluster that is 
> the user account for using the cluster.  That account has uid 1000 on 
> all four worker nodes, but on the controller it is 1001.  So that is 
> probably why the question marks.

You need to have identical UIDs everywhere for this to work.

I would strongly suggest using something like LDAP to ensure that your users 
have identical representation everywhere.

All the best,
Chris
-- 
   Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA



Reply via email to