On Mon, Oct 14, 2013 at 8:46 AM, Dave Jones <da...@redhat.com> wrote: > Machine is wedged and I can't get to it until tomorrow, but this is what was > on serial console. > kernel running was from some time last Friday, I can get exact info tomorrow, > though > I don't think there's anything epoll related recently that could explain this.
It looks like it is the access to "lock->key" that takes a page fault. The pointer looks good (%r13=ffff8801654cec98), so I'm pretty sure this is due to DEBUG_PAGEALLOC and a free'd page. So it looks like ep_unregister_pollwait() calls remove_wait_queue() on a wait-queue head that has already been free'd. I have this dim memory of us having fought this before. But maybe I'm just remembering some of the old signalfd-vs-epoll races. Oleg, does this trigger any memory for you? Commit 971316f0503a ("epoll: ep_unregister_pollwait() can use the freed pwq->whead") just makes me go "Hmm, this is *exactly* that that commit is talking about.." Linus --- > Oops: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC > CPU: 3 PID: 449 Comm: trinity-main Not tainted 3.12.0-rc4+ #98 > task: ffff88023e239560 ti: ffff880083082000 task.ti: ffff880083082000 > RIP: 0010:[<ffffffff810c9f98>] [<ffffffff810c9f98>] > __lock_acquire+0x58/0x1be0 > Call Trace: > [<ffffffff810cc2d3>] lock_acquire+0x93/0x200 > [<ffffffff81732c3b>] _raw_spin_lock_irqsave+0x4b/0x90 > [<ffffffff810848d9>] remove_wait_queue+0x19/0x40 > [<ffffffff812120eb>] ep_unregister_pollwait.isra.14+0x5b/0x1e0 > [<ffffffff81212786>] ep_remove+0x26/0x140 > [<ffffffff81213391>] eventpoll_release_file+0x71/0xa0 > [<ffffffff811c4faa>] __fput+0x2aa/0x2d0 > [<ffffffff811c501e>] ____fput+0xe/0x10 > [<ffffffff8107d67c>] task_work_run+0xac/0xe0 > [<ffffffff81056bd7>] do_exit+0x2c7/0xcc0 > [<ffffffff810589cc>] do_group_exit+0x4c/0xc0 > [<ffffffff81058a54>] SyS_exit_group+0x14/0x20 > [<ffffffff8173bf64>] tracesys+0xdd/0xe2 > Code: 85 c0 8b 05 4b d6 bc 00 45 0f 45 e0 85 c0 0f 84 07 01 00 00 8b 05 31 af > 00 01 49 89 fd 41 89 f7 41 89 d3 85 c0 0f 84 08 01 00 00 <49> 8b 45 00 ba 01 > 00 00 00 48 3d 60 6a 13 82 44 0f 44 e2 41 83 > RIP [<ffffffff810c9f98>] __lock_acquire+0x58/0x1be0 > RSP <ffff880083083c18> > CR2: ffff8801654cec98 > ---[ end trace 044e98c2d3aab216 ]--- > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/