[OMPI users] using specific algorithm for collective communication, and knowing the root cpu?

George Markomanolis Mon, 2 Nov 2009 08:53:07 -0500

Dear all,

I would like to ask about collective communication. With debug modeenabled, I can see many info during the execution which algorithm isused etc. But my question is that I would like to use a specificalgorithm (the simplest I suppose). I am profiling some applications andI want to simulate them with another program so I must be able to knowfor example what the mpi_allreduce is doing. I saw many algorithms thatdepend on the message size and the number of processors, so I would liketo ask:

1) what is the way to say at open mpi to use a simple algorithm forallreduce (is there any way to say to use the simplest algorithm for allthe collective communication?). Basically I would like to know the rootcpu for every collective communication. What are the disadvantages fordemanding the simplest algorithm?

2) Is there any overhead because I installed open mpi with debug modeeven if I just run a program without any flag with --mca?

3) How you could describe allreduce by words? Can we say that the rootcpu does reduce and then broadcast? I mean is that right for yourimplementation? I saw that it depends on the algorithm which cpu is theroot, so is it possible to use an algorithm that I will know every timethat cpu with rank 0 is the root?


Thanks a lot,
George

[OMPI users] using specific algorithm for collective communication, and knowing the root cpu?

Reply via email to