Re: [OMPI users] Anyscientific application heavily using MPI_Barrier?

Eugene Loh Mon, 24 Aug 2009 16:22:45 -0400

Jeff Squyres wrote:

On Aug 24, 2009, at 1:03 PM, Eugene Loh wrote:
E.g., let's say P0 and P1 each send a message to P2, both using thesame tag and communicator. Let's say P2 does two receives on thatcommunicator and tag, using a wildcard source. So, the messagescould be received in either order. One could introduce barriers toorder the messages. E.g.,
P0:
  Send
  Barrier
P1:
  Barrier
  Send
P2:
  Recv
  Barrier
  Recv
Is this behavior *guaranteed* by MPI? I'm not actually sure that itis; barrier does not provide any guarantees about point-to-pointmessage passing progress.
For example, how about a machine with these assumptions:

- P0 is "far away" from P2 on the point-to-point network
- P1 is "close by" to P2 on the point-to-point network
- Barriers go across a separate/fast network (think: bluegene)
- P0's send message is short/eager
In this case, the Send from P0 complete "immediately" and enter thebarrier before it is delivered to P2. The P0 send could then take a"long time" to get to P2 --


Okay, so let's say P0 completes its send and enters the barrier.

Also, P1 enters the barrier. But it will not issue a send until itleaves the barrier, which requires that the last process has entered thebarrier.

Meanwhile, the last process, P2, is waiting on a receive before itenters the barrier.

So, here's the situation. P2 is waiting to receive a message, a messagehas been sent to P2, and no other message will be sent to P2 until somemessage has been received. So, there are only two options:


1) The first receive on P2 receives the message from P0.  Or,

2) This perfectly legal MPI program deadlocks.

Right?

potentially long enough for the barrier to  overtake it

No. The first Recv on P2 has to complete before P2 can enter thebarrier, which is a prerequisite for the barrier to complete on any process.

and for the Send from P1 to be delivered to P2 before the Send fromP0 arrives at P2.
Couldn't that happen?

No. The send on P1 cannot be issued before the barrier completes on P1,which cannot happen before the barrier is entered on P2, which cannothappen before the first Recv on P2 is completed, which cannot happenuntil some message is received on P2. And, the only message that can bereceived on P2 is the one issued by P0.

Granted, I would expect that your example would perform in most real-world situations as you describe (P0 is delivered to P2, then P1 isdelivered to P2). But I don't think the standard guarantees it.

Re: [OMPI users] Anyscientific application heavily using MPI_Barrier?

Reply via email to