Re: [OMPI users] CPU user time vs. system time

Ralph Castain Fri, 26 Jun 2009 23:28:14 -0400

If you are running fewer processes on your nodes than they haveprocessors, then you can improve performance by adding


-mca mpi_paffinity_alone 1

to your cmd line. This will bind your processes to individual cores,which helps with latency. If your program involves collectives, thenyou can try setting


-mca coll_hierarch_priority 100

This will activate the hierarchical collectives, which utilize sharedmemory for messages between procs on the same node.


Ralph


On Jun 26, 2009, at 9:09 PM, Qiming He wrote:

Hi all,
I am new to OpenMPI, and have an urgent run-time question. I haveopenmpi-1.3.2 compiled with Intel Fortran compiler v.11 simply by
./configure --prefix=<my-dir> F77=ifort FC=ifort
then I set my LD_LIBRARY_PATH to include <openmpi-lib> and <intel-lib>
and compile my Fortran program properly. No compilation error.
I run my program on single node. Everything looks ok. However, whenI run it on multiple nodes.
mpirun -np <num> --hostfile <my-hosts> <my-program>
The performance is much worse than a single node with the same sizeof the problem to solve (MPICH2 has 50% improvement)
I use top and saidar to find that user time (CPU user) is much lowerthan system time (CPU system), i.e,only small portion of CPU time is used by user application, whilethe rest is busy with system.No wonder I got bad performance. I am assuming "CPU system" is usedfor MPI communication.I notice the total traffic (on eth0) is not that big (~5Mb/sec).What is CPU system busy for?
Can anyone help? Anything I need to tune?

Thanks in advance

-Qiming







_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Re: [OMPI users] CPU user time vs. system time

Reply via email to