Re: [OMPI users] Collective operations and synchronization

George Bosilca Mon, 23 Mar 2009 17:41:29 -0400

Unfortunately even the MPI_Barrier doesn't guarantee a synchronousexit on all processes. There is no such thing in the MPI and there isno way to implement such a synchronization primitive in general (ifone take in account metrics such as performance or scalability).

In this particular context the MPI_Allreduce offers you exactly thesame kind of synchronization as the MPI_Barrier. Moreover, all nonrooted collectives (with the exception of the v versions) imply asynchronous behavior simply because all processes involved in thecollective have to participate with some data.


  george.

On Mar 23, 2009, at 17:11 , Ralph Castain wrote:

Just one point to emphasize - Eugene said it, but many times peopledon't fully grasp the implication.
On an MPI_Allreduce, the algorithm requires that all processes -enter- the call before anyone can exit.
It does -not- require that they all exit at the same time.
So if you want to synchronize on the -exit-, as your questionindicated, then you must add the MPI_Barrier as you describe.
Ralph


On Mar 23, 2009, at 3:01 PM, Eugene Loh wrote:
Shaun Jackman wrote:
I've just read in the Open MPI documentation [1]
That's the MPI spec, actually.
that collective operations, such as MPI_Allreduce, maysynchronize, but do not necessarily synchronize. My algorithmrequires a collective operation and synchronization; is there abetter (more efficient?) method than simply calling MPI_Allreducefollowed by MPI_Barrier?
MPI_Allreduce is a case that actually "requires" synchronization inthat no participating process may exit before all processes haveentered. So, there should be no need to add additionalsynchronization. A special case might be an MPI_Allreduce of a 0-length message, in which case I suppose an MPI implementation couldsimple "do nothing", and the synchronization side-effect would belost.
The MPI spec is mainly talking about a "typical" collective whereone could imagine a process exiting before some processes haveentered. E.g., in a broadcast or scatter, the root could exitbefore any other process has entered the operation. In a reduce orgather, the root could enter after all other processes haveexited. For all-to-all, allreduce, or allgather, however, noprocess can exit before all processes have entered, which is thesynchronization condition effected by a barrier. (Again, nullmessage lengths can change things.)
[1] http://www.mpi-forum.org/docs/mpi21-report-bw/node85.htm
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Re: [OMPI users] Collective operations and synchronization

Reply via email to