Hi Eric,
Yes, UCX is supposed to be stable for large sized problems.
Did you see the same crash with both OMPI-4.0.3 + UCX 1.8.0 and OMPI-4.1.2 +
UCX1.11.2?
Have you also tried to run large sized problems test with OMPI-5.0.x?
Regarding the application, at some point it invokes MPI_Alltoallv send
Hi,
In the past, we have successfully launched large sized (finite elements)
computations using PARMetis as mesh partitioner.
It was first in 2012 with OpenMPI (v2.?) and secondly in March 2019 with
OpenMPI 3.1.2 that we succeeded.
Today, we have a bunch of nightly (small) tests running nic