I know this is a little off-topic, but I thought I'd pass on some hard-won
knowledge to HPC cluster administrators...
Short version:
--
You should probably either disable the Linux OOM killer on your cluster (even
if you have swap disabled on your compute nodes), or configure it so
Hi,
I made my testing program simpler as shown below.
I compared openmpi-1.6.2 and openmpi1.7rc1/4 again
in system cpu usage while some processes wait for
others.
Then, the result is same as reported bofore.
system cpu usage
openmpi-1.6.2 0%
openmpi-1.7rc1 70%
openmpi-1.7