Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-22 Thread Yugendra Guvvala
Just to close loop on this. This was not as Slurm issue it was more of AD configuration. AD needs to be installed on all nodes of cluster that way SLURM knows the USER ID. I had trouble with sssd DB folders missing and sssd.conf file having appropriate permissions. So look put for those. You

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread John Hearns
Matthew, that deserves an explanation. Bright Computing Proof of Concept causes nightmares? That is a pretty strong assertion. Please give more details. On Wed, 13 Feb 2019 at 16:01, Matthew BETTINGER < matthew.bettin...@external.total.com> wrote: > One of the main guy Panos left Bright so no ans

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Matthew BETTINGER
One of the main guy Panos left Bright so no answer to your specific question but I hope you can get some support with it. We dumped our BC PoC, the sysadmin working on the PoC still has nightmares. On 2/13/19, 6:54 AM, "slurm-users on behalf of John Hearns" wrote: Yugendra, the Brigh

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Antony Cleave
one method I've used a lot of times in bright is to integrate a compute node in the same way as the master and logins (I normally use realm join...) and then grab the changes back into to the image in cmsh, If you are worried you can clone into a new image Then you can make sure your compute nodes

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Yugendra Guvvala
Thanks Guys. I will go through all resources and report back how it goes. Thanks, Yugi > On Feb 13, 2019, at 7:58 AM, John Hearns wrote: > > please have a look at section 6.3 of the Bright Admin Manual > You have run updateprovisioners then rebooted the nodes? > > > Configuring The Cluster To

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread John Hearns
please have a look at section 6.3 of the Bright Admin Manual You have run updateprovisioners then rebooted the nodes? Configuring The Cluster To Authenticate Against An External LDAP Server The cluster can be configured in different ways to authenticate against an external LDAP server. For smaller

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread John Hearns
Yugendra, the Bright support guys are excellent. Slurm is their default choice. I would ask again. Yes, Slurm is technically out of scope for them, but they shoudl help a bit. By the way, I think your problem is that you have configured authentication using AD on your head node. BUT you have not

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Antony Cleave
Can you ssh in as root and the su to the AD user to make sure that the node is integrated correctly? If you cannot su to an AD user on the node then Slurm will not be able to resolve the UID either as they use the same methods. On Wed, 13 Feb 2019, 12:35 Yugendra Guvvala, wrote: > No, we can’t

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Loris Bennett
Yugendra Guvvala writes: >> On Feb 13, 2019, at 1:50 AM, Loris Bennett >> wrote: >> >> Yugendra Guvvala writes: >> >>> Hi, >>> >>> We are bringing a new cluster online. We installed SLURM through Bright >>> Cluster Manager how ever we are running into a issue here. >>> >>> We are able

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Ole Holm Nielsen
On 2/13/19 1:14 PM, Yugendra Guvvala wrote: Thank you, this is strange. Is there a way to integrate AD authentication with SLURM or Munge. Or allow all users who login to run jobs with out any restrictions . Integration of Slurm with LDAP accounts have been implemented at EPFL in Switzerland

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Yugendra Guvvala
Also reached out to bright computing support and they say slurm is out of scope for them. Thanks, Yugi > On Feb 13, 2019, at 7:27 AM, Antony Cleave wrote: > > can you ssh to the compute node that job was trying to run on as as the AD > user in question? > > I've seen similar issues on AD i

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Yugendra Guvvala
No, we can’t ssh to compute nodes. And this is by design that no one should be able to ssh to compute nodes other than root. I figure that munge is not configured for AD. We have configured our login image for AD and slurm and mung configurations are on head node. Not sure how to integrate the

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Antony Cleave
can you ssh to the compute node that job was trying to run on as as the AD user in question? I've seen similar issues on AD integrated systems where some nodes boot from a different image that have not yet been joined to the domain. Antony On Wed, 13 Feb 2019 at 04:58, Yugendra Guvvala < yguvv.

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-13 Thread Yugendra Guvvala
Hi Loris, Thank you, this is strange. Is there a way to integrate AD authentication with SLURM or Munge. Or allow all users who login to run jobs with out any restrictions . Thanks, Yugi > On Feb 13, 2019, at 1:50 AM, Loris Bennett wrote: > > Yugendra Guvvala writes: > >> Hi, >> >> We ar

Re: [slurm-users] New Bright Cluster Slurm issue for AD users

2019-02-12 Thread Loris Bennett
Yugendra Guvvala writes: > Hi, > > We are bringing a new cluster online. We installed SLURM through Bright > Cluster Manager how ever we are running into a issue here. > > We are able to run jobs as root user and users created using bright cluster > (cmsh commands). How ever we use AD authent