Hi
I don't know what version of Slurm you're using or how it may be different
from the one I'm using (18.05), but here's my understanding of memory
limits and what I'm seeing on our cluster. The parameter
`JobAcctGatherParams=OverMemoryKill` controls whether a step is killed if
it goes over the r
Hello,
I am in the situation where evaluating the precise memory consumption of jobs
beforehand is pretty challenging. So I would like to create a “trust” system,
meaning that the requested memory for jobs is taken into account for
scheduling, but no action is taken if the job actually breach t