[Public] Hi Folks,
As the number of cores in a socket is keep on increasing, the right pml,btl (ucx, ob1, uct, vader, etc) that gives the best performance in "intra-node" scenario is important. For openmpi-4.1.4, which pml, btl combination is the best for intra-node communication in the case of higher core count scenario? (p-to-p as well as coll) and why? Does the answer for the above question holds good for the upcoming ompi5 release? --Arun