On Tue, 19 Dec 2017 22:22:27 -0800, Jakub Kicinski wrote: > > >I get this: > > > > Could you try to run it with kasan on? > > I didn't manage to reproduce it with KASAN on so far :( Even enabling > object debugging to get the second splat in my email (which is more > useful) actually makes the crash go away, I only see the warning...
Ah, no object debug but KASAN on produces this: [ 39.268209] BUG: KASAN: use-after-free in cpu_needs_another_gp+0x246/0x2b0 [ 39.275965] Read of size 8 at addr ffff8803aa64f138 by task swapper/13/0 [ 39.283524] [ 39.285256] CPU: 13 PID: 0 Comm: swapper/13 Not tainted 4.15.0-rc3-perf-00955-g1d0b01347dd5-dirty #8 [ 39.295535] Hardware name: Dell Inc. PowerEdge R730/072T6D, BIOS 2.3.4 11/08/2016 [ 39.303969] Call Trace: [ 39.306769] <IRQ> [ 39.309088] dump_stack+0xa6/0x118 [ 39.312957] ? _atomic_dec_and_lock+0xe8/0xe8 [ 39.317895] ? cpu_needs_another_gp+0x246/0x2b0 [ 39.323030] print_address_description+0x6a/0x270 [ 39.328380] ? cpu_needs_another_gp+0x246/0x2b0 [ 39.333510] kasan_report+0x23f/0x350 [ 39.337672] cpu_needs_another_gp+0x246/0x2b0 ... [ 39.383026] rcu_process_callbacks+0x1a0/0x620 ... [ 39.426713] __do_softirq+0x17f/0x4de ... [ 39.463841] irq_exit+0xe1/0xf0 [ 39.467437] smp_apic_timer_interrupt+0xd9/0x290 [ 39.472685] ? smp_call_function_single_interrupt+0x230/0x230 [ 39.479195] ? smp_reschedule_interrupt+0x240/0x240 [ 39.484736] apic_timer_interrupt+0x8c/0xa0 [ 39.489497] </IRQ> [ 39.491929] RIP: 0010:cpuidle_enter_state+0x12a/0x510 [ 39.497660] RSP: 0018:ffff88086bf9fd08 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff11 [ 39.506228] RAX: 0000000000000000 RBX: ffffe8ffffb060e0 RCX: ffffffff921329f5 [ 39.514291] RDX: dffffc0000000000 RSI: dffffc0000000000 RDI: ffff88086f3246e8 [ 39.522354] RBP: 1ffff1010d7f3fa6 R08: fffffbfff2742768 R09: fffffbfff2742768 [ 39.530418] R10: ffff88086bf9fcc8 R11: fffffbfff2742767 R12: 0000000924148b4b [ 39.538480] R13: 0000000000000004 R14: 0000000000000004 R15: ffffffff9383eb80 [ 39.546545] ? sched_idle_set_state+0x25/0x30 [ 39.551502] ? cpuidle_enter_state+0x106/0x510 [ 39.556556] ? cpuidle_enter_s2idle+0x130/0x130 [ 39.561706] ? rcu_eqs_enter_common.constprop.62+0xd1/0x1e0 [ 39.568037] ? rcu_gp_init+0xf70/0xf70 [ 39.572331] ? sched_set_stop_task+0x160/0x160 [ 39.577384] do_idle+0x1af/0x200 [ 39.581076] cpu_startup_entry+0xd2/0xe0 [ 39.585545] ? cpu_in_idle+0x20/0x20 [ 39.589626] ? _raw_spin_trylock+0xe0/0xe0 [ 39.594292] ? memcpy+0x34/0x50 [ 39.597890] start_secondary+0x271/0x2b0 [ 39.602361] ? set_cpu_sibling_map+0x840/0x840 [ 39.607416] secondary_startup_64+0xa5/0xb0 [ 39.612180] [ 39.613929] Allocated by task 1358: [ 39.617914] __kmalloc_node+0x183/0x2c0 [ 39.622290] qdisc_alloc+0xbd/0x3f0 [ 39.626274] qdisc_create+0xd8/0x720 [ 39.630355] tc_modify_qdisc+0x657/0x910 [ 39.634826] rtnetlink_rcv_msg+0x37c/0x7e0 [ 39.639491] netlink_rcv_skb+0x122/0x230 [ 39.643960] netlink_unicast+0x2ae/0x360 [ 39.648443] netlink_sendmsg+0x5d5/0x620 [ 39.652915] sock_sendmsg+0x64/0x80 [ 39.656900] ___sys_sendmsg+0x4a8/0x500 [ 39.661272] __sys_sendmsg+0xa9/0x140 [ 39.665450] entry_SYSCALL_64_fastpath+0x1e/0x81 [ 39.670695] [ 39.672441] Freed by task 1370: [ 39.676052] kfree+0x8d/0x1c0 [ 39.679454] qdisc_graft+0x208/0x670 [ 39.683535] tc_get_qdisc+0x229/0x350 [ 39.687713] rtnetlink_rcv_msg+0x37c/0x7e0 [ 39.692411] netlink_rcv_skb+0x122/0x230 [ 39.696881] netlink_unicast+0x2ae/0x360 [ 39.701350] netlink_sendmsg+0x5d5/0x620 [ 39.705819] sock_sendmsg+0x64/0x80 [ 39.709801] ___sys_sendmsg+0x4a8/0x500 [ 39.714172] __sys_sendmsg+0xa9/0x140 [ 39.718351] entry_SYSCALL_64_fastpath+0x1e/0x81 [ 39.723597] [ 39.725347] The buggy address belongs to the object at ffff8803aa64ef80 [ 39.725347] which belongs to the cache kmalloc-512 of size 512 [ 39.739453] The buggy address is located 440 bytes inside of [ 39.739453] 512-byte region [ffff8803aa64ef80, ffff8803aa64f180) [ 39.752684] The buggy address belongs to the page: [ 39.758127] page:0000000042b3124b count:1 mapcount:0 mapping: (null) index:0x0 compound_mapcount: 0 [ 39.769222] flags: 0x2ffff0000008100(slab|head) [ 39.774365] raw: 02ffff0000008100 0000000000000000 0000000000000000 0000000180190019 [ 39.783129] raw: dead000000000100 dead000000000200 ffff8803afc0ed80 0000000000000000 [ 39.791986] page dumped because: kasan: bad access detected [ 39.798300] [ 39.800063] Memory state around the buggy address: [ 39.805503] ffff8803aa64f000: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb [ 39.813684] ffff8803aa64f080: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb [ 39.821866] >ffff8803aa64f100: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb [ 39.830045] ^ [ 39.835778] ffff8803aa64f180: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc [ 39.843958] ffff8803aa64f200: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb