I see assorted problems with OMPI 4.1 on IB, including failing many of the mpich tests (non-mpich-specific ones) particularly with RMA. Now I wonder if UCX build options could have anything to do with it, but I haven't found any relevant information.
What configure options would be recommended with CUDA and ConnectX-5 IB? (This is on POWER, but I presume that's irrelevant.) I assume they should be at least --enable-cma --enable-mt --with-cuda --with-gdrcopy --with-verbs --with-mlx5-dv but for a start I don't know what the relationship is between the cuda, shared memory, and multi-threading options in OMPI and UCX. Thanks for any enlightenment.