On Sun, Feb 14, 2016 at 10:05:22PM -0800, Paul E. McKenney wrote: > On Sun, Feb 14, 2016 at 09:50:18AM +0300, Denis Kirjanov wrote: > > Tracepoints use RCU for protection and they must not be called on > > offline CPUS. So make this tracepoint conditional. > > Good catch! Queued for review and testing.
And I dequeued this in favor of Steven's more recent patch. But there is one other hitch... It is currently not legal to invoke call_rcu() from an offline CPU. You can get away with it during a short window towards the end of the offline process, but shortly after the outgoing CPU hits the idle loop, call_rcu() will splat and leak the callback. So what exactly is the purpose of invoking call_rcu() from an offline CPU? (Yes, I could probably make it work, but there needs to be a good reason.) Thanx, Paul > > NFO: suspicious RCU usage. ] > > [ 413.344670] 4.4.0-00006-g0fe53e8-dirty #33 Tainted: G S > > [ 413.344672] ------------------------------- > > [ 413.344673] include/trace/events/rcu.h:457 suspicious > > rcu_dereference_check() usage! > > [ 413.344674] > > other info that might help us debug this: > > > > [ 413.344676] > > RCU used illegally from offline CPU! > > rcu_scheduler_active = 1, debug_locks = 1 > > [ 413.344678] no locks held by swapper/4/0. > > [ 413.344679] > > stack backtrace: > > [ 413.344682] CPU: 4 PID: 0 Comm: swapper/4 Tainted: G S > > 4.4.0-00006-g0fe53e8-dirty #33 > > [ 413.344683] Call Trace: > > [ 413.344692] [c0000005b76b7800] [c0000000008bb080] .dump_stack+0x98/0xd4 > > (unreliable) > > [ 413.344698] [c0000005b76b7880] [c00000000010c8b8] > > .lockdep_rcu_suspicious+0x108/0x170 > > [ 413.344703] [c0000005b76b7910] [c00000000013b9e4] > > .__call_rcu.constprop.60+0x264/0x600 > > [ 413.344708] [c0000005b76b79e0] [c0000000002bceec] .put_object+0x5c/0x80 > > [ 413.344712] [c0000005b76b7a60] [c00000000029a368] > > .kmem_cache_free+0x298/0x450 > > [ 413.344716] [c0000005b76b7b00] [c000000000093494] .__mmdrop+0x54/0x150 > > [ 413.344720] [c0000005b76b7b90] [c0000000000e4010] > > .idle_task_exit+0x130/0x140 > > [ 413.344725] [c0000005b76b7c20] [c000000000075804] > > .pseries_mach_cpu_die+0x64/0x310 > > [ 413.344730] [c0000005b76b7cd0] [c000000000043e7c] .cpu_die+0x3c/0x60 > > [ 413.344734] [c0000005b76b7d40] [c0000000000188d8] > > .arch_cpu_idle_dead+0x28/0x40 > > [ 413.344738] [c0000005b76b7db0] [c000000000101e8c] > > .cpu_startup_entry+0x50c/0x560 > > [ 413.344741] [c0000005b76b7ed0] [c000000000043bd8] > > .start_secondary+0x328/0x360 > > [ 413.344746] [c0000005b76b7f90] [c000000000008a6c] > > start_secondary_prolog+0x10/0x14 > > > > Signed-off-by: Denis Kirjanov <k...@linux-powerpc.org> > > --- > > > > v2: Fix the build error that was made > > while sending the pacthes from another machine > > > > include/trace/events/rcu.h | 4 +++- > > 1 file changed, 3 insertions(+), 1 deletion(-) > > > > diff --git a/include/trace/events/rcu.h b/include/trace/events/rcu.h > > index ef72c4a..793d306b 100644 > > --- a/include/trace/events/rcu.h > > +++ b/include/trace/events/rcu.h > > @@ -428,13 +428,15 @@ TRACE_EVENT(rcu_prep_idle, > > * number of lazy callbacks queued, and the fourth element is the > > * total number of callbacks queued. > > */ > > -TRACE_EVENT(rcu_callback, > > +TRACE_EVENT_CONDITION(rcu_callback, > > > > TP_PROTO(const char *rcuname, struct rcu_head *rhp, long qlen_lazy, > > long qlen), > > > > TP_ARGS(rcuname, rhp, qlen_lazy, qlen), > > > > + TP_CONDITION(cpu_online(raw_smp_processor_id())), > > + > > TP_STRUCT__entry( > > __field(const char *, rcuname) > > __field(void *, rhp) > > -- > > 2.4.0 > >