Re: [PATCH 08/19] perf c2c: Shared data analyser

Don Zickus Mon, 03 Mar 2014 12:34:29 -0800

On Mon, Mar 03, 2014 at 07:58:19PM +0100, Andi Kleen wrote:
> On Mon, Mar 03, 2014 at 07:41:17PM +0100, Peter Zijlstra wrote:
> > On Mon, Mar 03, 2014 at 01:07:00PM -0500, Joe Mario wrote:
> > > If you only sample on the HITMs then you don't get the store misses.
> > > That means you'll not be able to detect who is simultaneously tugging
> > > on the same cache lines.  That gives up much of the value of "perf
> > > c2c".
> > 
> > As long as you know which lines are hurting bringing in (loads) you can
> > often figure out who is doing the stores on them.
> 
> Yes, especially since every store is a load too (unless you're talking
> WC)


Thoughts on how to determine which load is a potential store?  I agree
every store needs to load the cacheline, but I wasn't sure if there was an
approach that could be applied to determine anything useful.

Cheers,
Don

> 
> The method c2c uses is more exact, but keep in mind it's a sampling
> heuristic in any cases, with some potential bias. load-latency tags
> the loads randomly and there's no guarantee that tagging is fully
> uniform. Also you only see a subset in any case.
> 
> > 
> > > As we developed this, we ended up settling on Ivy Bridge to get the
> > > behavior we wanted.
> > 
> > Wouldn't SNB also work?
> 
> Yes.
> 
> Haswell is best however because it can report addresses on far more
> events.
> 
> -Andi
> 
> -- 
> [email protected] -- Speaking for myself only.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Re: [PATCH 08/19] perf c2c: Shared data analyser

Reply via email to