tsi...@coas.oregonstate.edu wrote:
Thanks for the explanation. I am using GigEth + Open MPI and the
buffered MPI_BSend. I had already noticed that top behaved
differently on another cluster with Infinibandb + MPICH.
So the only option to find out how much time each process is waiting
around seems to be to profile the code. Will gprof show me anything
useful or will I have to use a more sophisticated (any free ones?)
parallel profiler?
Another frequently asked question! I can try to add a FAQ
entry/category. There are a number of free options including
TAU http://www.cs.uoregon.edu/research/tau/home.php
mpiP http://mpip.sourceforge.net/
FPMPI http://www.mcs.anl.gov/research/projects/fpmpi/WWW/index.html
IPM http://ipm-hpc.sourceforge.net/
Sun Studio http://developers.sun.com/sunstudio/
The only one I've really used is Sun Studio.
Jumpshot *might* work with Open MPI, I forget. Or, it might be more an
MPICH tool.