Re: [OMPI users] Tip for HPC cluster admins

2012-10-29 Thread John Hearns
Jeff, this is very good advice. I have had many, many hours of deep joy getting to know the OOM killer and all of his wily ways. Respect the OOM Killer! On cluster I manage, the OOM killer is working, however there is a strict policy that if OOM killer kicks on in a cluster node it is excluded f

[OMPI users] Tip for HPC cluster admins

2012-10-28 Thread Jeff Squyres
I know this is a little off-topic, but I thought I'd pass on some hard-won knowledge to HPC cluster administrators... Short version: -- You should probably either disable the Linux OOM killer on your cluster (even if you have swap disabled on your compute nodes), or configure it so