Is it a single lgroup? 1GB pages might mean remote memory for one of the
CPUs, and local for another.
Is it really TLB misses - does cputrack show an increase in TLB misses?
Do you get more cache misses?
Does pmap indicate that the app got large pages?
Regards,
Darryl.
On 8/20/2010 9:43 PM
Hi,
My AMD Opteron supports 4KB, 2MB and 1GB page sizes. I observed that there is
performance improvement (reduced elapsed time) for some multi-threaded
applications when I used 2MB page-size for heap. These applications need around
650MB heap (it reads a huge file of around 650MB size). However