Paul Kapinos <kapi...@itc.rwth-aachen.de> writes:

> Nathan,
> unfortunately '--mca memory_linux_disable 1' does not help on this
> issue - it does not change the behaviour at all.
>  Note that the pathological behaviour is present in Open MPI 2.0.2 as
> well as in /1.10.x, and Intel OmniPath (OPA) network-capable nodes are
> affected only.

[I guess that should have been "too" rather than "only".  It's loading
the openib btl that is the problem.]

> The known workaround is to disable InfiniBand failback by '--mca btl
> ^tcp,openib' on nodes with OPA network. (On IB nodes, the same tweak
> lead to 5% performance improvement on single-node jobs;

It was a lot more than that in my cp2k test.

> but obviously
> disabling IB on nodes connected via IB is not a solution for
> multi-node jobs, huh).

But it works OK with libfabric (ofi mtl).  Is there a problem with
libfabric?

Has anyone reported this issue to the cp2k people?  I know it's not
their problem, but I assume they'd like to know for users' sake,
particularly if it's not going to be addressed.  I wonder what else
might be affected.
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to