On Thu, Jun 11, 2015 at 03:13:57PM +0300, Alexander Shishkin wrote:
> @@ -1426,7 +1426,23 @@ perf_event_nmi_handler(unsigned int cmd, struct 
> pt_regs *regs)
>       u64 finish_clock;
>       int ret;
>  
> -     if (!atomic_read(&active_events))
> +     /*
> +      * Intel PT is a separate PMUs, it doesn't increment active_events
> +      * nor does it use resources associated with it, such as DS area.
> +      * However, it still uses the same PMI handler, so here we need
> +      * to make sure to call it *also* if any PT events are present.
> +      *
> +      * BTS events currently increment active_events implicitly through
> +      * x86_reserve_hardware(), where it acts as DS area reference
> +      * counter, so no need to check x86_lbr_exclusive_bts counter here;
> +      * otherwise we'd have to do the same for BTS.
> +      *
> +      * Theoretically, they could be made into separate PMI handlers, but
> +      * that can create additional challenges as PT uses the same PMI status
> +      * register as x86_pmu.
> +      */
> +     if (!atomic_read(&active_events) &&
> +         !atomic_read(&x86_pmu.lbr_exclusive[x86_lbr_exclusive_pt]))
>               return NMI_DONE;
>  

Urgh, sad. That's two cache misses instead of one. Cant we somehow keep
that a single value to read?
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to