Re: [OMPI users] Any scientific application heavily using MPI_Barrier?

Ganesh Fri, 6 Mar 2009 13:06:08 -0500

Hi Dick,

Jeff paraphrased an unnamed source as suggesting that: "any MPIprogram that relies on a barrier for correctness is an incorrect MPIapplication." . That is probably too strong.
How about this assertion?
*If there are no wildcard receives - every MPI_Barrier call issemantically irrelevant.*

This depends on what 'semantically irrelevant' means. It is clear thatone can write a wildcard-free program that will deadlock if you insert abarrier incorrectly, but that removing the barrier will avoid thedeadlock. (Imagine P1 doing a Send; Barrier and P2 doing a Barrier;Receive(nonwildcard)).

So a wildcard-free program may still deadlock (semantically noticeableeffect) by having barriers. I'm sure you did not mean to include thisdegenerate nit-pick - but yes otherwise you are right! The proof existsin a Siegel paper (cited in our EuroPVM'08) for a subset of MPI. Ourwork takes that idea further and offers a complete checking algorithmfor one test harness (data set) as now described.

The exact consideration for locating semantically irrelevant barriers(we call it Functionally Irrelevant Barriers in our paper) is given inour EuroPVM / MPI 2008 paper. The analysis involves ordering paths --IntraCB and InterCB. CB stands for Completes-before.

What is IntraCB? Imagine two MPI sends from P1 to P2 in that order. MPIforces them to complete in program order. Now imagine P1 sending to P2and then P1 sending to P3. These can complete out of program order. Whyso? Because MPI guarantees only non-overtaking (point-to-pointnon-overtaking). It also makes sense practically: the first send may beshipping a gigabyte to P2 and the second shipping a byte to P3.

IntraCB is a weak relation wrt program order. We have accurately definedIntraCB in our CAV 2008 paper (available on our website). The basic ideais simple: about 6-7 rules capture IntraCB (like in the above example).

Now in our EuroPVM 2008 paper, we show how to lift IntraCB to InterCB bycomputing a "closure" thru barriers. This defines MPI ordering paths.This is again a simple idea.

The gist of FIB is this: if an ordering path is affected by the removalof barriers, then that barrier is functionally relevant; else it is not.FIB does this analysis for all possible ordering paths.

How are all ordering paths determined? Well for this, FIB needs helpfrom our POE algorithm (CAV 2008) that generates the RELEVANT executionsof an MPI program. Basically POE gives you the semantically minimal(close to minimal; slightly bloated is possible) set of interleavings ofan MPI program. Here is the idea: if you write an MPI program with P1sending to P2, P3 sending to P2, and P3 doing a wildcard receive fromeither P1 or P2, our POE algorithm generates two interleavings. Theseare sufficient. No need to consider all permutations of posting send1,send2, receive(*) in all orders. The POE algorithm is essential for FIBto work.


In fact, thru a few mouse-clicks, you can do all this

1) download ISP fromhttp://www.cs.utah.edu/formal_verification/ISP-release

2) fire it up

3) If running under Linux, use the --fib flag; if running under Windows,the flag is on by default4) ISP verifies the program for assert failures, MPI object leaks, anddeadlocks5) If ISP stops w/o deadlocks found (ie all goes well) it prints thelist of FIBs.

Please try - we will appreciate it greatly! We may have alwaysoverlooked something -- we will be very grateful if you could offerfeedback to improve our ISP tool that contains the FIB algo implementation.

As a bonus, if you read our EuroPVM / MPI 2008 paper, you will find, inits first 3-4 pages, some "brain teasers" that you can read, and see ifyou think those barriers could be removed. Next you can type those 3-4line examples into ISP and see what it says wrt the FIB status.


I'm not fibbing... :-)

Cheers,

Ganesh

p.s. I said that FIB does the analysis for one data set. As in ourpaper, we have shown that in many cases, a static analyzer can determinethat a program is data independent. In that case, the FIB analysis holdsfor all inputs (input = test harness = data set).

--

It is the exception that tests the rule.
If someone can provide an example of an MPI_Barrier that is requiredby an application based on MPI communication and that does not usewildcard receive I am interested in seeing it. I do not know of acounter example but also do not have proof of the assertion I placebefore the group.
No fair using examples with non-MPI interactions among tasks or withjob steering by asynchronous triggers from outside the job. I canconstruct them myself.
MPI_WIN_FENCE is semantically required in some situations and examplesthat show a semantic need for MPI_WIN_FENCE do not count against theassertion.
I have appreciated the descriptions from Gus, Asjley and others ofsome non-symantic justifications for an MPI_Barrier.
Dick Treumann - MPI Team
IBM Systems & Technology Group
Dept X2ZA / MS P963 -- 2455 South Road -- Poughkeepsie, NY 12601
Tele (845) 433-7846 Fax (845) 433-8363

------------------------------------------------------------------------

_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Re: [OMPI users] Any scientific application heavily using MPI_Barrier?

Reply via email to