Re: [slurm-users] RES: Change something in user's script using job_submit.lua plugin

2023-10-27 Thread Ole Holm Nielsen
Hi Paulo, Maybe what you see is due to a bug then? You might try to update Slurm to see if has been fixed. You should not use the Slurm RPMs from EPEL - I think offering these RPMs was a mistake. Anyway you ought to upgrade to the latest Slurm 23.02.6 since a serious security issue was fi

Re: [slurm-users] slurm cluster error - bad node index

2023-10-27 Thread Patrick Goetz
Hi - Very delayed response to this, as I'm working my way through a backlog of slurm-user posts. If this error is intermittent, it's likely a hardware issue. Recently I ran into an problem where a host with 8 GPUs was spontaneously rebooting a couple of minutes after a user would start an 8

[slurm-users] RES: Change something in user's script using job_submit.lua plugin

2023-10-27 Thread Paulo Jose Braga Estrela
Hi Ole, Yes, the script is running and changing other fields like comment, partition, account is working fine. The only problem seems to be the script field of job_rec. I'm using Slurm 20.11.9 from EPEL repository for RHEL 8. Thank you for sharing your Wiki. I've accessed it before. It's really