Aaron Sherman <[EMAIL PROTECTED]> wrote: > Good stuff! One thing leaps to mind when you mention the cache though... > keep in mind that blowing L2 cache (which we might be in no danger of > doing at all, but I'm just bringing it up) might be WORSE than you would > think on P4 and beyond because of hyperthreading.
Well, turn off ARENA_DOD_FLAGS and force a few DOD runs with one million PMCs in some array. Run valgrind [1] on that. Then turn on ARENA_DOD_FLAGS again and watch the difference :) I know that cache misses have worse impact on either faster CPUs or with HT. Valgrind docs state that a L2 miss is worth around 200 CPU cycles, where you can do something meaningful - e.g. avoid that miss. leo [1] $ cat vgcp valgrind --skin=cachegrind parrot "$@"