On Jun 26, 2020, at 7:30 AM, Peter Kjellström via users <users@lists.open-mpi.org> wrote: > >> The cluster hardware is QLogic infiniband with Intel CPUs. My >> understanding is that we should be using the old PSM for networking. >> >> Any thoughts what might be going wrong with the build? > > Yes only PSM will perform well on that hardware. Make sure that PSM > works on the system. Then make sure you got a mca_mtl_psm built.
I think Peter is right: you want to use mpirun --mca pml cm --mca mtl psm ... I *think* Intel InfiniPath is PSM and Intel OmniPath is PSM2, so "psm" is what you want (not "psm2"). Don't try to use pml/ob1 + btl/openib, and don't try to use UCX. PSM is Intel's native support for its Infinipath network. -- Jeff Squyres jsquy...@cisco.com