Jeff, this is very good advice.
I have had many, many hours of deep joy getting to know the OOM killer
and all of his wily ways.
Respect the OOM Killer!
On cluster I manage, the OOM killer is working, however there is a
strict policy that if OOM killer kicks on in a cluster node it is
excluded f
I know this is a little off-topic, but I thought I'd pass on some hard-won
knowledge to HPC cluster administrators...
Short version:
--
You should probably either disable the Linux OOM killer on your cluster (even
if you have swap disabled on your compute nodes), or configure it so