Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-08-03 Thread byron
Thanks for everyones help. All I needed to do was compile a new version of pam_slurm.so. I'm aware there's a newer slurm_pam_adopt but everything was already setup for pam_slurm.so so I just went with that. Regards Lloyd On Wed, Jul 27, 2022 at 9:45 PM Bernd Melchers wrote: > >This happen

Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread Bernd Melchers
>This happens on all our compute nodes. >I can't find any mention of slurm_pam_adopt in /etc/pamd.d. All I have >is in sshd, account required pam_slurm.so. We had a similar problem, caused by wrong access bits for ssh host key files in /etc/ssh/ now we have -rw-r--r-- root root fo

Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread Fulcomer, Samuel
>From our /etc/pam.d/sshd on our compute nodes accountrequired pam_nologin.so accountsufficientpam_access.so accountinclude password-auth -accountrequired pam_slurm_adopt.so and /pam.d/password-auth: #-session optional pam_systemd.so Note that di

Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread byron
This happens on all our compute nodes. I can't find any mention of slurm_pam_adopt in /etc/pamd.d. All I have is in sshd, account required pam_slurm.so. On Wed, Jul 27, 2022 at 5:52 PM Brian Andrus wrote: > Lloyd, > > You could check out the order of entries in your pam.d/ssh (and > related/i

Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread Brian Andrus
Lloyd, You couldĀ  check out the order of entries in your pam.d/ssh (and related/included) files See where the slurm_pam_adopt is, how it is being called and if there are settings that are interferring. Does this occur only on a single node, or all of them? Brian Andrus On 7/27/2022 9:29 A

Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread Lloyd Goodman
I don't think that's the source of the problem. All our user accounts are centrally managed using sssd. And just to be sure I run "getent passwd " on the management, head and compute nodes and they all returned the same values On Wed, 27 Jul 2022 at 17:22, Brian Andrus wrote: > Verify that the

Re: [slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread Brian Andrus
Verify that their uid on the node is the same as the uid your master sees Brian Andrus On 7/27/2022 8:53 AM, byron wrote: Hi When a user tries to login into a compute node on which they have a running job they get the error Access denied: user blahblah (uid=) has no active jobs on this

[slurm-users] unable to ssh onto compute nodes on which I have running jobs

2022-07-27 Thread byron
Hi When a user tries to login into a compute node on which they have a running job they get the error Access denied: user blahblah (uid=) has no active jobs on this node. Authentication failed. I recently upgraded slurm to 20.11.9 and was under the impression that prior to the upgrade they w