On Thu, Jul 16, 2015 at 12:15 AM, Peter Zijlstra <pet...@infradead.org> wrote:
> On Thu, Jul 16, 2015 at 08:02:03AM +0200, Stephane Eranian wrote:
>> Been running it for a couple of hours, so far so good. I will let it
>> run all night.
>
> Thanks!
>
Well, it died on NHM in the same function despite your patch. Need to
look at the exact warning.\
So more work is needed. But then I also saw the irq loop stuck message
before that.


>> > ---
>> >  arch/x86/kernel/cpu/perf_event_intel_ds.c | 29 
>> > +++++++++++++----------------
>> >  1 file changed, 13 insertions(+), 16 deletions(-)
>> >
>> > diff --git a/arch/x86/kernel/cpu/perf_event_intel_ds.c 
>> > b/arch/x86/kernel/cpu/perf_event_intel_ds.c
>> > index 71fc40238843..68d0ced1d229 100644
>> > --- a/arch/x86/kernel/cpu/perf_event_intel_ds.c
>> > +++ b/arch/x86/kernel/cpu/perf_event_intel_ds.c
>> > @@ -1142,6 +1142,7 @@ static void intel_pmu_drain_pebs_nhm(struct pt_regs 
>> > *iregs)
>> >
>> >         for (at = base; at < top; at += x86_pmu.pebs_record_size) {
>> >                 struct pebs_record_nhm *p = at;
>> > +               u64 pebs_status;
>> >
>> >                 /* PEBS v3 has accurate status bits */
>> >                 if (x86_pmu.intel_cap.pebs_format >= 3) {
>> > @@ -1152,12 +1153,14 @@ static void intel_pmu_drain_pebs_nhm(struct 
>> > pt_regs *iregs)
>> >                         continue;
>> >                 }
>> >
>> > -               bit = find_first_bit((unsigned long *)&p->status,
>> > +               pebs_status = p->status & cpuc->pebs_enabled;
>> > +               pebs_status &= (1ULL << x86_pmu.max_pebs_events) - 1;
>> > +
>> > +               bit = find_first_bit((unsigned long *)&pebs_status,
>> >                                         x86_pmu.max_pebs_events);
>> >                 if (bit >= x86_pmu.max_pebs_events)
>> >                         continue;
>
> Maybe we should WARN in this case? A PEBS entry without any PEBS bits
> set in the status field would be 'weird', right?
>
> Maybe something like:
>
>                 if (WARN(bit >= x86_pmu.max_pebs_events,
>                          "PEBS record without PEBS event! status=%Lx 
> pebs_enabled=%Lx active_mask=%Lx",
>                          p->status, cpuc->pebs_enabled, cpuc->active_mask))
>                         continue;
>
> If that triggers we at least get more info.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to