Re: [O-MPI users] LAM vs OPENMPI performance

Tom Rosmond Wed, 4 Jan 2006 17:03:55 -0500

Thanks for the quick reply. I ran my tests with a hostfile with
cedar.reachone.com slots=4


I clearly misunderstood the role of the 'slots' parameter, because
when I removed it, OPENMPI slightly outperformed LAM, which I
assume it should. Thanks for the help.

Tom



Brian Barrett wrote:

On Jan 4, 2006, at 4:24 PM, Tom Rosmond wrote:
I have been using LAM-MPI for many years on PC/Linux systems and
have been quite pleased with its performance. However, at theurging of the
LAM-MPI website, I have decided to switch to OPENMPI.  For much of my
preliminary testing I work on a single processor workstation (seethe attached'config.log' and ompi_info.log files for some of the specifics ofmy system). Ifrequently run with more than one virtual mpi processor (i.e.oversubscribethe real processor) to test my code. With LAM the runtime penaltyfor thisis usually insignificant for 2-4 virtual processors, but withOPENMPI it hasbeen prohibitive. Below is a matrix of runtimes for a simple MPImatrixtranspose code using mpi_sendrecv( I tried other variations ofblocking/non-blocking, synchronous/non-synchronous send/recv with similarresults).
                     message size=      262144  bytes

                                LAM                OPENMPI
               1 proc:  .02575 secs          .02513 secs
               2 proc:  .04603 secs          10.069 secs
               4 proc:  .04903 secs          35.422 secs
I am pretty sure that LAM exploits the fact that the virtualprocessors are allsharing the same memory, so communication is via memory and/or thePCI busof the system, while my OPENMPI configuration doesn't exploitthis. Is thisa reasonable diagnosis of the dramatic difference in performance?Moreimportantly, how to I reconfigure OPENMPI to match the LAMperformance.
Based on the output of ompi_info, you should be using shared memorywith Open MPI (as you are with LAM/MPI). What RPI are you using withLAM/MPI (just so we have some idea what you are comparing to)? Andhow are you running Open MPI (what command are you passing to mpirun,and if you include a hostfile, what is in that host file)?
If you tell Open MPI via a hostfile that a machine has 2 cpus when itonly has 1 and try to run 2 processes on it, you will run into severeperformance issues. In that case, Open MPI will poll very quickly onthe CPUs, not giving up the CPU when there is nothing to do. If OpenMPI is told that there is only 1 cpu and you run 2 procs of the samejob on that node, then it will be much better about giving up theCPU. That would be where I would start looking.
If you have some test code you could share, I'd love to see it - itwould help in duplicating your results and finding a solution...
Brian

Re: [O-MPI users] LAM vs OPENMPI performance

Reply via email to