Hi, > compling with -m64 -xopenmp gives one or two more hundred MB/s > but the variation of the values between different runs can also vary almost > the same amount. So one probably have to run a series of 10 tests or more > and compute > the mean value to see a consistent difference. >
Was this with OMP_NUM_THREADS set to processor count? You tend to need quite a few iterations, when we do runs in my group we tend to run stream twenty times, average that result, and than reboot, and get at least ten more iterations of the twenty times average. > Thinking about it theres only one MMU on the Athlon64 even if I use > both > cores to throw data at it. And the CPU is in the region of six to four > times faster > than RAM. So using one Core should be enough to drive RAM at top speed. > What size are you setting N in the streams code to? We pull this out of smbios info where its available. - Fintan _______________________________________________ perf-discuss mailing list perf-discuss@opensolaris.org