"Jeff Squyres (jsquyres)" <jsquy...@cisco.com> writes: > Good question. I've filed > https://github.com/open-mpi/ompi/issues/8379 so that we can track > this.
For the benefit of the list: I mis-remembered that osc=ucx was general advice. The UCX docs just say you need to avoid the uct btl, which can cause memory corruption, but OMPI 4.1 still builds and uses it by default. (The UCX doc also suggests other changes to parameters, but for performance rather than correctness.) Anyway, I can get at least IMB-RMA to run on this Summit-like hardware just with --mca btl ^uct (though there are failures with other tests which seem to be specific to UCX on ppc64le, and not to OMPI).