Mathieu,

>       cycles_per_iter = 0.0;
>       for (i=0; i<NR_TESTS; i++) {
>               time1 = get_cycles();
>               for (j = 0; j < NR_ITER; j++) {
>                       testval = &array[random() % ARRAY_SIZE];
>               }
>               time2 = get_cycles();
>               cycles_per_iter += (time2 - time1)/(double)NR_ITER;
>       }
>       cycles_per_iter /= (double)NR_TESTS;
>       printf("Just getting the pointer, doing noting with it, cycles
per
> iteration (mean) : %g\n", cycles_per_iter);
> 

Some comments on the code:

1. random() is counted in cycle_per_iter, which can skew the results.
You could pre-compute the random addresses and store them in an array.
Then, during the actual timing, walk the array:

index = 0;
for (i = 0; i < ARRAY_SIZE; i++)
      index = *(int *)(array + index * CACHE_LINE_SIZE);

2. You may want to flush the cache before the timing starts.

3. You want to access memory at the cache-line granularity to avoid
addresses falling into the same line (and thus unwanted hits).

If you do these, I expect you'll get a higher memory latency.

  tong
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to