Public bug reported: After a recent upgrade on my clients to Ubuntu 13.04 latest and Ubuntu 12.04.03 on the server, NFS4 clients began locking up with the errors below. Processes accessing NFS would hang up and lock, and could not be killed or adjusted. NFSv3 on the same server and client has been working fine.
NFS4 was configured with clean defaults, no special options set. No log entries corresponding with the problem were seen on the nfs server. Oct 22 09:22:57 edmonton kernel: [ 324.099094] NFS: nfs4_reclaim_open_state: Lock reclaim failed! Oct 22 09:27:34 edmonton kernel: [ 600.213861] INFO: task Chrome_CacheThr:3411 blocked for more than 120 seconds. Oct 22 09:27:34 edmonton kernel: [ 600.213869] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Oct 22 09:27:34 edmonton kernel: [ 600.213872] Chrome_CacheThr D ffff88040fc33ec0 0 3411 1 0x00000000 Oct 22 09:27:34 edmonton kernel: [ 600.213880] ffff8803fce77aa8 0000000000000086 ffff880407749740 ffff8803fce77fd8 Oct 22 09:27:34 edmonton kernel: [ 600.213889] ffff8803fce77fd8 ffff8803fce77fd8 ffff880409d51740 ffff880407749740 Oct 22 09:27:34 edmonton kernel: [ 600.213895] ffff880407749740 ffff88040fc34778 0000000000000002 ffffffff8112f420 Oct 22 09:27:34 edmonton kernel: [ 600.213903] Call Trace: Oct 22 09:27:34 edmonton kernel: [ 600.213918] [<ffffffff8112f420>] ? __lock_page+0x70/0x70 Oct 22 09:27:34 edmonton kernel: [ 600.213931] [<ffffffff816cc4c9>] schedule+0x29/0x70 Oct 22 09:27:34 edmonton kernel: [ 600.213937] [<ffffffff816cc59f>] io_schedule+0x8f/0xd0 Oct 22 09:27:34 edmonton kernel: [ 600.213941] [<ffffffff8112f42e>] sleep_on_page+0xe/0x20 Oct 22 09:27:34 edmonton kernel: [ 600.213948] [<ffffffff816caabb>] __wait_on_bit_lock+0x5b/0xc0 Oct 22 09:27:34 edmonton kernel: [ 600.213953] [<ffffffff8112f417>] __lock_page+0x67/0x70 Oct 22 09:27:34 edmonton kernel: [ 600.213961] [<ffffffff8107df40>] ? autoremove_wake_function+0x40/0x40 Oct 22 09:27:34 edmonton kernel: [ 600.213965] [<ffffffff8112ffae>] find_lock_page+0x4e/0x80 Oct 22 09:27:34 edmonton kernel: [ 600.213971] [<ffffffff811303ac>] grab_cache_page_write_begin+0x6c/0xf0 Oct 22 09:27:34 edmonton kernel: [ 600.213997] [<ffffffffa01f6d78>] nfs_write_begin+0x78/0x230 [nfs] Oct 22 09:27:34 edmonton kernel: [ 600.214003] [<ffffffff8112f1e3>] generic_file_buffered_write+0x113/0x270 Oct 22 09:27:34 edmonton kernel: [ 600.214010] [<ffffffff81130ea4>] __generic_file_aio_write+0x1b4/0x3d0 Oct 22 09:27:34 edmonton kernel: [ 600.214018] [<ffffffff81196ac4>] ? __sb_start_write+0x54/0x110 Oct 22 09:27:34 edmonton kernel: [ 600.214023] [<ffffffff8113113f>] generic_file_aio_write+0x7f/0x100 Oct 22 09:27:34 edmonton kernel: [ 600.214035] [<ffffffffa01f7391>] nfs_file_write+0xc1/0x1e0 [nfs] Oct 22 09:27:34 edmonton kernel: [ 600.214041] [<ffffffff81194337>] do_sync_write+0xa7/0xe0 Oct 22 09:27:34 edmonton kernel: [ 600.214046] [<ffffffff81194a1c>] vfs_write+0xac/0x180 Oct 22 09:27:34 edmonton kernel: [ 600.214051] [<ffffffff81194efa>] sys_pwrite64+0x9a/0xb0 Oct 22 09:27:34 edmonton kernel: [ 600.214058] [<ffffffff816d5c5d>] system_call_fastpath+0x1a/0x1f Oct 22 09:29:27 edmonton avahi-daemon[1613]: Invalid response packet from host 128.111.246.200. Oct 22 09:29:27 edmonton avahi-daemon[1613]: Invalid response packet from host fe80::3aea:a7ff:fe6e:e945. Oct 22 09:29:34 edmonton kernel: [ 720.042124] INFO: task Chrome_CacheThr:3411 blocked for more than 120 seconds. Oct 22 09:29:34 edmonton kernel: [ 720.042131] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Oct 22 09:29:34 edmonton kernel: [ 720.042134] Chrome_CacheThr D ffff88040fc33ec0 0 3411 1 0x00000000 Oct 22 09:29:34 edmonton kernel: [ 720.042142] ffff8803fce77aa8 0000000000000086 ffff880407749740 ffff8803fce77fd8 Oct 22 09:29:34 edmonton kernel: [ 720.042151] ffff8803fce77fd8 ffff8803fce77fd8 ffff880409d51740 ffff880407749740 Oct 22 09:29:34 edmonton kernel: [ 720.042157] ffff880407749740 ffff88040fc34778 0000000000000002 ffffffff8112f420 Oct 22 09:29:34 edmonton kernel: [ 720.042165] Call Trace: Oct 22 09:29:34 edmonton kernel: [ 720.042181] [<ffffffff8112f420>] ? __lock_page+0x70/0x70 Oct 22 09:29:34 edmonton kernel: [ 720.042193] [<ffffffff816cc4c9>] schedule+0x29/0x70 Oct 22 09:29:34 edmonton kernel: [ 720.042199] [<ffffffff816cc59f>] io_schedule+0x8f/0xd0 Oct 22 09:29:34 edmonton kernel: [ 720.042204] [<ffffffff8112f42e>] sleep_on_page+0xe/0x20 Oct 22 09:29:34 edmonton kernel: [ 720.042210] [<ffffffff816caabb>] __wait_on_bit_lock+0x5b/0xc0 Oct 22 09:29:34 edmonton kernel: [ 720.042215] [<ffffffff8112f417>] __lock_page+0x67/0x70 Oct 22 09:29:34 edmonton kernel: [ 720.042223] [<ffffffff8107df40>] ? autoremove_wake_function+0x40/0x40 Oct 22 09:29:34 edmonton kernel: [ 720.042228] [<ffffffff8112ffae>] find_lock_page+0x4e/0x80 Oct 22 09:29:34 edmonton kernel: [ 720.042233] [<ffffffff811303ac>] grab_cache_page_write_begin+0x6c/0xf0 Oct 22 09:29:34 edmonton kernel: [ 720.042259] [<ffffffffa01f6d78>] nfs_write_begin+0x78/0x230 [nfs] Oct 22 09:29:34 edmonton kernel: [ 720.042264] [<ffffffff8112f1e3>] generic_file_buffered_write+0x113/0x270 Oct 22 09:29:34 edmonton kernel: [ 720.042271] [<ffffffff81130ea4>] __generic_file_aio_write+0x1b4/0x3d0 Oct 22 09:29:34 edmonton kernel: [ 720.042278] [<ffffffff81196ac4>] ? __sb_start_write+0x54/0x110 Oct 22 09:29:34 edmonton kernel: [ 720.042283] [<ffffffff8113113f>] generic_file_aio_write+0x7f/0x100 Oct 22 09:29:34 edmonton kernel: [ 720.042295] [<ffffffffa01f7391>] nfs_file_write+0xc1/0x1e0 [nfs] Oct 22 09:29:34 edmonton kernel: [ 720.042301] [<ffffffff81194337>] do_sync_write+0xa7/0xe0 Oct 22 09:29:34 edmonton kernel: [ 720.042307] [<ffffffff81194a1c>] vfs_write+0xac/0x180 Oct 22 09:29:34 edmonton kernel: [ 720.042312] [<ffffffff81194efa>] sys_pwrite64+0x9a/0xb0 Oct 22 09:29:34 edmonton kernel: [ 720.042318] [<ffffffff816d5c5d>] system_call_fastpath+0x1a/0x1f Oct 22 09:29:34 edmonton kernel: [ 720.042323] INFO: task Chrome_SyncThre:3462 blocked for more than 120 seconds. Oct 22 09:29:34 edmonton kernel: [ 720.042325] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Oct 22 09:29:34 edmonton kernel: [ 720.042327] Chrome_SyncThre D ffff88040fc13ec0 0 3462 1 0x00000000 Oct 22 09:29:34 edmonton kernel: [ 720.042332] ffff8803f8d01d20 0000000000000086 ffff8803fced9740 ffff8803f8d01fd8 Oct 22 09:29:34 edmonton kernel: [ 720.042339] ffff8803f8d01fd8 ffff8803f8d01fd8 ffff880407460000 ffff8803fced9740 Oct 22 09:29:34 edmonton kernel: [ 720.042345] ffff8803fced9740 ffff88040fc14778 0000000000000002 ffffffff8112f420 Oct 22 09:29:34 edmonton kernel: [ 720.042352] Call Trace: Oct 22 09:29:34 edmonton kernel: [ 720.042357] [<ffffffff8112f420>] ? __lock_page+0x70/0x70 Oct 22 09:29:34 edmonton kernel: [ 720.042362] [<ffffffff816cc4c9>] schedule+0x29/0x70 Oct 22 09:29:34 edmonton kernel: [ 720.042368] [<ffffffff816cc59f>] io_schedule+0x8f/0xd0 Oct 22 09:29:34 edmonton kernel: [ 720.042373] [<ffffffff8112f42e>] sleep_on_page+0xe/0x20 Oct 22 09:29:34 edmonton kernel: [ 720.042378] [<ffffffff816cac10>] __wait_on_bit+0x60/0x90 Oct 22 09:29:34 edmonton kernel: [ 720.042383] [<ffffffff8112f560>] wait_on_page_bit+0x80/0x90 Oct 22 09:29:34 edmonton kernel: [ 720.042388] [<ffffffff8107df40>] ? autoremove_wake_function+0x40/0x40 Oct 22 09:29:34 edmonton kernel: [ 720.042392] [<ffffffff8112fa61>] filemap_fdatawait_range+0x101/0x190 Oct 22 09:29:34 edmonton kernel: [ 720.042402] [<ffffffff8113aa4e>] ? do_writepages+0x1e/0x40 Oct 22 09:29:34 edmonton kernel: [ 720.042407] [<ffffffff81130ad1>] ? __filemap_fdatawrite_range+0x51/0x60 Oct 22 09:29:34 edmonton kernel: [ 720.042412] [<ffffffff81130b45>] filemap_write_and_wait_range+0x65/0x70 Oct 22 09:29:34 edmonton kernel: [ 720.042432] [<ffffffffa040bd77>] nfs4_file_fsync+0x67/0xb0 [nfsv4] Oct 22 09:29:34 edmonton kernel: [ 720.042440] [<ffffffff811c42cd>] do_fsync+0x5d/0x90 Oct 22 09:29:34 edmonton kernel: [ 720.042445] [<ffffffff81194cdb>] ? sys_read+0x6b/0xa0 Oct 22 09:29:34 edmonton kernel: [ 720.042450] [<ffffffff811c4553>] sys_fdatasync+0x13/0x20 Oct 22 09:29:34 edmonton kernel: [ 720.042455] [<ffffffff816d5c5d>] system_call_fastpath+0x1a/0x1f ** Affects: nfs-utils (Ubuntu) Importance: Undecided Status: New ** Attachment added: "apport output" https://bugs.launchpad.net/bugs/1243429/+attachment/3887835/+files/nfs-utils-apport.txt -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1243429 Title: NFS Lockups with 13.04 clients to 12.04.03 server To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/1243429/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs