Re: [slurm-users] salloc problem

2022-10-30 Thread Chris Samuel
On 27/10/22 4:18 am, Gizo Nanava wrote: we run into another issue when using salloc interactively on a cluster where Slurm power saving is enabled. The problem seems to be caused by the job_container plugin and occurs when the job starts on a node which boots from a power down state. If I resu

Re: [slurm-users] Switch setting in slurm.conf breaks slurmctld if the switch type is not there in slurmcrld node

2022-10-30 Thread Chris Samuel
On 27/10/22 11:30 pm, Richard Chang wrote: Yes, the system is a HPE Cray EX, and I am trying to use switch/hpe_slingshot. Which version of Slurm are you using Richard? All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA

Re: [slurm-users] Prolog and job_submit

2022-10-30 Thread Chris Samuel
On 30/10/22 12:27 pm, Davide DelVento wrote: But if I understand correctly your Prolog vs TaskProlog distinction, the latter would have the environmental variable and run as user, whereas the former runs as root and doesn't get the environment, That's correct. My personal view is that injectin

[slurm-users] What happens if slurmdbd loses connection to mysql

2022-10-30 Thread Richard Chang
Hi, I have two dedicated nodes for slurm, node1 and node2. I have created the following. *Role* *SlurmCTLD* *SlurmDBD* *Mariadb Server for accounting storage* *Primary* Node1 Node2 Node2 *Backup* Node2 Nod

Re: [slurm-users] Prolog and job_submit

2022-10-30 Thread Davide DelVento
Hi Chris, > Unfortunately it looks like the license request information doesn't get > propagated into any prologs from what I see from a scan of the > documentation. :-( Thanks. If I am reading you right, I did notice the same thing and in fact that's why I wrote that job_submit lua script which

Re: [slurm-users] Prolog and job_submit

2022-10-30 Thread Chris Samuel
On 30/10/22 10:23 am, Chris Samuel wrote: Unfortunately it looks like the license request information doesn't get propagated into any prologs from what I see from a scan of the documentation. 🙁 This _may_ be fixed in the next major Slurm release (February) if I'm reading this right: https:

Re: [slurm-users] Prolog and job_submit

2022-10-30 Thread Chris Samuel
On 29/10/22 7:37 am, Davide DelVento wrote: So either I misinterpreted that "same environment as the user tasks" or there is something else that I am doing wrong. Slurm has a number of different prologs that can run which can cause confusion, and I suspect that's what's happening here. The