Re: [OMPI users] Calling MPI_Test() too many times results in a time spike

Eugene Loh Tue, 30 Nov 2010 12:23:42 -0500

Ioannis Papadopoulos wrote:

Has anyone observed similar behaviour? Is it something that I'll haveto deal with it in my code or does it indeed qualify as an issue to belooked into?

I would say this is NOT an issue that merits much attention. There aretoo many potential performance anomalies that you might be encounteringand they aren't worth "fixing" (or even understanding) unless theyimpact your application's performance in a meaningful way.


E.g., try timing "nothing".  Here is a sample test program:

#include <stdio.h>
#include <mpi.h>

#define N 1000000

int main(int argc, char **argv) {
 int i;
 double t[N], tavg = 0, tmin = 1.e20, tmax = 0;

 MPI_Init(&argc,&argv);
 for ( i = 0; i < N; i++ ) {
   t[i] = MPI_Wtime();
   t[i] = MPI_Wtime() - t[i];
 }
 for ( i = 0; i < N; i++ ) {
   tavg += t[i];
   if ( tmin > t[i] ) tmin = t[i];
   if ( tmax < t[i] ) tmax = t[i];
 }
 tavg /= N;

 printf("avg %12.3lf\n", tavg * 1.e6);
 printf("min %12.3lf\n", tmin * 1.e6);
 printf("max %12.3lf\n", tmax * 1.e6);

 MPI_Finalize();
 return 0;
}

I find that the minimum is 0 (indicating non-infinitesimal granularityof the timer), the average is small (some overhead of the timer call),and the maximum is very large. Why? Because something will happen nowand then. What it is doesn't matter unless your application'sperformance is suffering.

You report that the overall time is about the same. That is, it takesjust over a second to receive the message, which is expected if thesender delays a second before sending.

One of the things you could do is look at total time to receive themessage and total time spent in MPI_Test. Then, vary TIMEOUT moresmoothly (0.000001, 0.000002, 0.000005, 0.00001, 0.00002, 0.00005,0.0001, 0.0002, 0.0005, 0.001, 0.002, 0.005, 0.01, 0.02). You may alsohave to run many times to see how reproducible the results are. AsTIMEOUT increases, the total time to get the message will roughlyincrease, but not by much until TIMEOUT gets pretty large. The totaltime spent in MPI_Test should fall as TIMEOUT increases. So, the ideais that by increasing TIMEOUT, you decrease the responsiveness of thereceiver while you make more CPU time available for other tasks. Withany luck, there will be a broad range of TIMEOUT values that degraderesponsiveness negligibly while freeing a meaningful amount of time upfor other computational tasks.

The performance of MPI_Test() -- and of a particular MPI_Test() call --is probably not very meaningful.

Note that your MPI_Irecv calls should strictly speaking haveMPI_ANY_SOURCE rather than MPI_ANY_TAG.

Re: [OMPI users] Calling MPI_Test() too many times results in a time spike

Reply via email to