Thanks Jeff for your answer.

It is sad that the approach I mentioned of having all memory registered for 
user process on cluster nodes didn't become more popular.

I still believe that such an approach would shorten the executed code path in 
MPI libraries, reduce message latency, increase the communication/computation 
overlap potential and allows communication progress more naturally.

But now since we have to live with memory registration issues, what changes 
should be done to standard Linux distro so that Open MPI can best use a recent 
Mellanox Infiniband network ?

I guess that installing the ummunotify kernel module is a good idea ?

Maybe also removing the limits on the "max locked memory" (ulimit -l) is also 
good ?

Beside that, I guess that installing the latest OFED (to have the latest 
middleware) instead of using the default one coming with the Linux distro is a 
good idea ?

Also does the XPMEM kernel module for more efficient intra node transfer of 
large message worth installing since kernels now include the CMA API ?

Thanks,

Martin Audet

Reply via email to