Re: [OMPI users] collective algorithms

Jeff Squyres Tue, 2 Dec 2008 08:59:57 -0500

On Nov 25, 2008, at 10:29 AM, Максим Чусовлянов wrote:

Hello! How i can integrated my collective communication algorithm inopenMPI with MCA?

Sorry for the delay in answering -- SC08 and the US holiday last weekgot in the way and I'm way behind on answering the mails in my INBOX.

Just to make sure we're talking about the same thing -- you have a newcollective algorithm for one of the MPI collective functions, and youwant to include that code in Open MPI so that it can be invoked byMPI_<foo> in MPI applications, right?

If so, the right way to do this is to build a new Open MPI"coll" (collective) component containing the code for your newalgorithm. Our coll components are basically a few housekeepingfunctions and a bunch of function pointers for the functions to callthat are the back-ends to MPI collective functions (i.e., MPI_Bcastand friends).

All the "coll" component code is under the ompi/mca/coll/ directory.The "base" directory is some "glue" code for the coll framework itself-- it's not a component. But all other directories are standalonecomponents that have corresponding dynamic shared objects (DSOs)installed under $pkglibdir (typically $prefix/lib/openmpi).

You can build a component inside or outside of the Open MPI tree. Ifyou build outside of the Open MPI tree, you need to configure OMPIwith --with-devel-headers, which will install all of OMPI's internalheaders under $prefix. That way, you can -I these headers when youcompile your component. Just install your DSO in $pkglibdir; if allgoes well, "ompi_info | grep coll" should show your component.

If you build inside of the Open MPI tree, you need to make yourcomponent dir under ompi/mca/coll/ and include a configure.params file(look at ompi/mca/coll/basic/configure.params for a simple example)and a Makefile.am (see ompi/mca/coll/basic/Makefile.am for anexample). Then run the "autogen.sh" script that is at the top of thetree and then run configure. You should see your component listed inboth the autogen.sh and configure output; configure should not that itplans to build that component. When you finish configure, build andinstall Open MPI. "ompi_info | grep coll" should show your component.


But I'm getting ahead of myself...  Let's go back a few steps...

When building inside the OMPI tree, if you need to check for variousthings to determine if you can build the component (i.e., some testsduring configure, such as checking for various hardware supportlibraries), you can also add a configure.m4 file in your component'sdirectory. This gets a little tricky if you're not familiar withAutoconf; let me know if you need some guidance here.

Now you can add the source code to the component. We have 2 importantabstractions that you need to know about:

- component: there is only one component instance in an MPI process.It has global state.- module: in the coll framework, there is one module instance forevery communicator that uses this component. It has local staterelevant to that specific communicator.


Think of "component" as a C++ class, and "module" as a C++ object.

Now read the comments in ompi/mca/coll/coll.h. This file contains thestruct interfaces for both the coll component and module. Webasically do everything by function pointer; the component returns aset of function pointers and each module returns a struct of functionpointers. These function pointers are invoked by libmpi at varioustimes for various functions; see coll.h for a description of each.

During coll module initialization (i.e., when a new communicator hasbeen created), there's a process called "selection" where OMPIdetermines which coll modules will be used on this communicator.Modules can include/exclude themselves from the selection process.For example, your algorithm may only be suitable forintracommunicators. So if the communicator in question that is beingcreated is an intercommunicator, you probably want to exclude yourmodule from selection. Or if your algorithm can only handle powers-of-two MPI processes, it should exclude itself if there is a non-power-of-two number of processes in the communicator. And so on.

We designed coll modules in OMPI v1.3 to be "mix-n-match"-able suchthat in a single communicator, you can use the broadcast function fromone module, but the gather function from a different module. Hence,multiple coll modules may be active on a single communicator. In yourcase, you'll need to make sure that your function has a higherpriority than the "tuned" coll component (which is the default in manycases).

I'd suggest working in the Open MPI v1.3 tree, as we're going torelease this version soon and all future work is being done here (vs.the v1.2 tree, which will eventually be deprecated).

Hopefully this is enough information to get you going. Please feelfree to ask more questions! But you might want to post followupquestions to the devel list; these aren't really user-level questions.


Good luck!

--
Jeff Squyres
Cisco Systems

Re: [OMPI users] collective algorithms

Reply via email to