[slurm-users] Re: srun launched mpi job occasionally core dumps

2024-05-02 Thread Henderson, Brent via slurm-users
Re-tested with slurm 23.02.7 (had to also disable slurmdbd and run the controller with the '-i' option) but still reproduced the issue fairly quickly. Feels like the issue might be some interaction with RHEL 9.3 cgroups and slurm. Not sure what to try next - hoping for some suggestions. Thank

[slurm-users] Re: Partition Preemption Configuration Question

2024-05-02 Thread Davide DelVento via slurm-users
Hi Jason, I wanted exactly the same and was confused exactly like you. For a while it did not work, regardless of what I tried, but eventually (with some help) I figured it out. What I set up and it is working fine is this globally PreemptType = preempt/partition_prio PreemptMode=REQUEUE and th

[slurm-users] Partition Preemption Configuration Question

2024-05-02 Thread Jason Simms via slurm-users
Hello all, The Slurm docs have me a bit confused... I'm wanting to enable job preemption on certain partitions but not others. I *presume* I would set PreemptType=preempt/partition_prio globally, but then on the partitions where I don't want jobs to be able to be preempted, I would set PreemptMode