Re: [OMPI users] Question about RDMA

Jeff Squyres Tue, 17 Jun 2008 12:28:50 -0400

On Jun 6, 2008, at 6:03 AM, Gabriele Fatigati wrote:

Hi Jeff,


Sorry for the delay in replying -- I was on vacation all last week.

thanks for you reply. I did understand previous questions aboutRDMA. Ever with SKaMPI, i tried to run with mpi_leave_pinned = 1, asyou have suggested. But also in this case, execution time is verysimilar to previous case.
Does it means that SKaMPI, reallocates buffer every time ? Forexample, with "MPI_Bcast-length" test, over 128 procs, thecollective is repeated about 28 times, increasing buffer size foreach step by internal formula, and finale buffer size =2097152 K.

It could be that SKaMPI does re-alloc its buffers for every call -- Ihave not looked at the internals of SKaMPI in quite a long time.

It could also be that OMPI is not using the mpi_leave_pinned support.Are you building OMPI with the memory manager? OMPI needs that memorymanager (ptmalloc2, in the case of Linux) to be able to properlyeffect mpi_leave_pinned support. You should be able to run ompi_info| grep malloc and see something like this:

MCA memory: ptmalloc2 (MCA v1.0, API v1.0, Componentv1.3)

If that line doesn't show, then OMPI was not built with the memorymanager support, and mpi_leave_pinned will have no effect.

Since there aren't advantages with leave_pinned = 1, it means thatSKaMPI doesn't allocates buffer of 2097152 K initially, but itallocates small buffer and reallocates buffer every time, with morelarge size. Is it possible? If no, which is the cause of similarperformance?

It *could* mean that SKaMPI doesn't re-use the same large buffer forsubsequent MPI operations. An examination of SKaMPI's code shouldpretty easily be able to tell if this is the case.

It could also be that OMPI is using internal bufferers for a pipelinedbroadcast -- I'll have to check with George on that.

Another question: RDMA pipeline protocol for long messages, inOpenMPI 1.2.6 is setting by default?

I can't quite parse that question. OMPI v1.2.6 uses the pipelinedprotocol for long messages by default. It uses a slightly differentprotocol when mpi_leave_pinned is active. Both of these should bedescribed on the OMPI FAQ.


--
Jeff Squyres
Cisco Systems

Re: [OMPI users] Question about RDMA

Reply via email to