Re: [OMPI users] Problem with MPI_BARRIER

Eugene Loh Fri, 9 Sep 2011 12:00:46 -0400


On 9/8/2011 11:47 AM, Ghislain Lartigue wrote:

I guess you're perfectly right!
I will try to test it tomorrow by putting a call system("wait(X)) befor the 
barrier!

What does "wait(X)" mean?

Anyhow, here is how I see your computation:

A)  The first barrier simply synchronizes the processes.
B)  Then you start a bunch of non-blocking, point-to-point messages.
C)  Then another barrier.
D)  Finally, the point-to-point messages are completed.

Your mental model might be that A, B, and C should be fast and that Dshould take a long time. The reality may be that the completion of allthose messages is actually taking place during C.


How about the following?

Barrier
t0 = MPI_Wtime()
start all non-blocking messages
t1 = MPI_Wtime()
Barrier
t2 = MPI_Wtime()
complete all messages
t3 = MPI_Wtime()
Barrier
t4 = MPI_Wtime()

Then, look at the data from all the processes graphically. Compare thepicture to the same experiment, but with middle Barrier missing.Presumably, the full iteration will take roughly as long in both cases.The difference, I might expect, would be that with the middle barrierpresent, it gets all the time and the message-completion is fast.Without the middle barrier, the message completion is slow. So, messagecompletion is taking a long time either way and the only difference iswhether it's taking place during your MPI_Test loop or during what youthought was only a barrier.

A simple way of doing all this is to run with a time-line profiler...some MPI performance analysis tool. You won't have to instrument thecode, dump timings, or figure out graphics. Just look at prettypictures! There is some description of tool candidates in the OMPI FAQat http://www.open-mpi.org/faq/?category=perftools

PS:
if anyone has more information about the implementation of the MPI_IRECV() 
procedure, I would be glad to learn more about it!

I don't know how much detail you want here, but I suspect not muchdetail is warranted. There is a lot of complexity here, but I think afew key ideas will help.

First, I'm pretty sure you're sending "long" messages. OMPI usuallysends such messages by queueing up a request. These requests can, inthe general case, be "progressed" whenever an MPI call is made. So,whenever you make an MPI call, get away from the thought that you'redoing one specific thing, as specified by the call and its arguments.Think instead that you will also be looking around to see whatever otherMPI work can be progressed.

Re: [OMPI users] Problem with MPI_BARRIER

Reply via email to