Update: using the mainline kernel, I observe a slightly different pattern. When running multiple heavy I/O operations in parallel (e.g. rsyncing a large ISO image to a container, performing an http upload into another one and running "yum update" on all containers), the large uploads start to stall and come to a crawling halt at some point.
"dmesg" reveals some different btrfs related issues: [ 6838.005920] INFO: task kworker/u16:0:5815 blocked for more than 120 seconds. [ 6838.005924] Not tainted 3.19.0-031900rc6-generic #201501261152 [ 6838.005925] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 6838.005926] kworker/u16:0 D ffff88024422bb18 0 5815 2 0x00000000 [ 6838.005953] Workqueue: events_unbound btrfs_async_reclaim_metadata_space [btrfs] [ 6838.005954] ffff88024422bb18 ffff88024422bad8 ffff88024422bfd8 00000000000141c0 [ 6838.005956] ffff88030c1b0700 ffff88021a1e13a0 ffff8802c78a75c0 ffff88024422bb08 [ 6838.005958] ffff88024422bc88 7fffffffffffffff 7fffffffffffffff ffff8802c78a75c0 [ 6838.005959] Call Trace: [ 6838.005965] [<ffffffff817cd6b9>] schedule+0x29/0x70 [ 6838.005968] [<ffffffff817d0445>] schedule_timeout+0x1b5/0x210 [ 6838.005972] [<ffffffff8108e01a>] ? __queue_delayed_work+0xaa/0x1a0 [ 6838.005974] [<ffffffff8108e5db>] ? try_to_grab_pending+0x4b/0x80 [ 6838.005976] [<ffffffff817cebc7>] wait_for_completion+0xa7/0x160 [ 6838.005979] [<ffffffff810a3fa0>] ? try_to_wake_up+0x2a0/0x2a0 [ 6838.005983] [<ffffffff8121d6c6>] writeback_inodes_sb_nr+0x86/0xb0 [ 6838.005997] [<ffffffffc0630b9d>] shrink_delalloc+0x10d/0x300 [btrfs] [ 6838.006011] [<ffffffffc0630e68>] flush_space+0xd8/0x150 [btrfs] [ 6838.006022] [<ffffffffc063175b>] btrfs_async_reclaim_metadata_space+0x14b/0x1d0 [btrfs] [ 6838.006024] [<ffffffff8108f6dd>] process_one_work+0x14d/0x460 [ 6838.006026] [<ffffffff810900bb>] worker_thread+0x11b/0x3f0 [ 6838.006029] [<ffffffff8108ffa0>] ? create_worker+0x1e0/0x1e0 [ 6838.006031] [<ffffffff81095cc9>] kthread+0xc9/0xe0 [ 6838.006032] [<ffffffff81095c00>] ? flush_kthread_worker+0x90/0x90 [ 6838.006035] [<ffffffff817d17fc>] ret_from_fork+0x7c/0xb0 [ 6838.006037] [<ffffffff81095c00>] ? flush_kthread_worker+0x90/0x90 [ 6957.962660] INFO: task kworker/u16:0:5815 blocked for more than 120 seconds. [ 6957.962667] Not tainted 3.19.0-031900rc6-generic #201501261152 [ 6957.962668] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 6957.962671] kworker/u16:0 D ffff88024422bb18 0 5815 2 0x00000000 [ 6957.962706] Workqueue: events_unbound btrfs_async_reclaim_metadata_space [btrfs] [ 6957.962709] ffff88024422bb18 ffff88024422bad8 ffff88024422bfd8 00000000000141c0 [ 6957.962713] ffff88030c1b0700 ffff88021a1e13a0 ffff8802c78a75c0 ffff88024422bb08 [ 6957.962716] ffff88024422bc88 7fffffffffffffff 7fffffffffffffff ffff8802c78a75c0 [ 6957.962720] Call Trace: [ 6957.962741] [<ffffffff817cd6b9>] schedule+0x29/0x70 [ 6957.962746] [<ffffffff817d0445>] schedule_timeout+0x1b5/0x210 [ 6957.962752] [<ffffffff8108e01a>] ? __queue_delayed_work+0xaa/0x1a0 [ 6957.962756] [<ffffffff8108e5db>] ? try_to_grab_pending+0x4b/0x80 [ 6957.962760] [<ffffffff817cebc7>] wait_for_completion+0xa7/0x160 [ 6957.962765] [<ffffffff810a3fa0>] ? try_to_wake_up+0x2a0/0x2a0 [ 6957.962771] [<ffffffff8121d6c6>] writeback_inodes_sb_nr+0x86/0xb0 [ 6957.962787] [<ffffffffc0630b9d>] shrink_delalloc+0x10d/0x300 [btrfs] [ 6957.962803] [<ffffffffc0630e68>] flush_space+0xd8/0x150 [btrfs] [ 6957.962817] [<ffffffffc063175b>] btrfs_async_reclaim_metadata_space+0x14b/0x1d0 [btrfs] [ 6957.962822] [<ffffffff8108f6dd>] process_one_work+0x14d/0x460 [ 6957.962826] [<ffffffff810900bb>] worker_thread+0x11b/0x3f0 [ 6957.962830] [<ffffffff8108ffa0>] ? create_worker+0x1e0/0x1e0 [ 6957.962834] [<ffffffff81095cc9>] kthread+0xc9/0xe0 [ 6957.962838] [<ffffffff81095c00>] ? flush_kthread_worker+0x90/0x90 [ 6957.962842] [<ffffffff817d17fc>] ret_from_fork+0x7c/0xb0 [ 6957.962846] [<ffffffff81095c00>] ? flush_kthread_worker+0x90/0x90 [ 6962.761961] systemd-hostnamed[15586]: Warning: nss-myhostname is not installed. Changing the local hostname might make it unresolveable. Please install nss-myhostname! [ 7437.789596] INFO: task yum:14547 blocked for more than 120 seconds. [ 7437.789600] Not tainted 3.19.0-031900rc6-generic #201501261152 [ 7437.789601] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 7437.789602] yum D ffff880286777868 0 14547 14546 0x00000000 [ 7437.789605] ffff880286777868 0000000200000001 ffff880286777fd8 00000000000141c0 [ 7437.789607] ffff88002e07db00 ffffffff81c1c500 ffff8801f8892740 ffff880286777858 [ 7437.789608] ffff8802867779d8 7fffffffffffffff 7fffffffffffffff ffff8801f8892740 [ 7437.789610] Call Trace: [ 7437.789616] [<ffffffff817cd6b9>] schedule+0x29/0x70 [ 7437.789619] [<ffffffff817d0445>] schedule_timeout+0x1b5/0x210 [ 7437.789623] [<ffffffff8108e01a>] ? __queue_delayed_work+0xaa/0x1a0 [ 7437.789625] [<ffffffff8108e5db>] ? try_to_grab_pending+0x4b/0x80 [ 7437.789628] [<ffffffff817cebc7>] wait_for_completion+0xa7/0x160 [ 7437.789634] [<ffffffff810a3fa0>] ? try_to_wake_up+0x2a0/0x2a0 [ 7437.789638] [<ffffffff8121d6c6>] writeback_inodes_sb_nr+0x86/0xb0 [ 7437.789674] [<ffffffffc0630b9d>] shrink_delalloc+0x10d/0x300 [btrfs] [ 7437.789692] [<ffffffffc0628cbd>] ? get_alloc_profile+0x5d/0x90 [btrfs] [ 7437.789707] [<ffffffffc06304c0>] ? btrfs_get_alloc_profile+0x30/0x40 [btrfs] [ 7437.789719] [<ffffffffc0630e68>] flush_space+0xd8/0x150 [btrfs] [ 7437.789731] [<ffffffffc06310b9>] reserve_metadata_bytes+0x1d9/0x590 [btrfs] [ 7437.789743] [<ffffffffc0624659>] ? btrfs_search_slot+0x3a9/0x870 [btrfs] [ 7437.789760] [<ffffffffc0664d50>] ? set_state_bits+0x40/0x80 [btrfs] [ 7437.789773] [<ffffffffc06320f5>] btrfs_block_rsv_add+0x35/0x60 [btrfs] [ 7437.789788] [<ffffffffc065fff2>] ? try_merge_map+0x32/0x150 [btrfs] [ 7437.789801] [<ffffffffc0649e15>] start_transaction.part.35+0x185/0x540 [btrfs] [ 7437.789813] [<ffffffffc064a1f9>] start_transaction+0x29/0x30 [btrfs] [ 7437.789824] [<ffffffffc064a53b>] btrfs_start_transaction+0x1b/0x20 [btrfs] [ 7437.789837] [<ffffffffc065344a>] maybe_insert_hole+0x8a/0x1b0 [btrfs] [ 7437.789847] [<ffffffffc0655452>] btrfs_cont_expand+0x1c2/0x340 [btrfs] [ 7437.789857] [<ffffffffc065f310>] btrfs_file_write_iter+0x2e0/0x360 [btrfs] [ 7437.789859] [<ffffffff811f408b>] new_sync_write+0x7b/0xb0 [ 7437.789861] [<ffffffff811f4f07>] vfs_write+0xc7/0x1f0 [ 7437.789862] [<ffffffff811f52af>] SyS_write+0x4f/0xb0 [ 7437.789865] [<ffffffff817cd6b9>] ? schedule+0x29/0x70 [ 7437.789867] [<ffffffff817d18ad>] system_call_fastpath+0x16/0x1b [ 7677.703046] INFO: task kworker/u16:10:16126 blocked for more than 120 seconds. [ 7677.703051] Not tainted 3.19.0-031900rc6-generic #201501261152 [ 7677.703053] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 7677.703055] kworker/u16:10 D ffff88010fdabb18 0 16126 2 0x00000000 [ 7677.703086] Workqueue: events_unbound btrfs_async_reclaim_metadata_space [btrfs] [ 7677.703088] ffff88010fdabb18 01000000000000a1 ffff88010fdabfd8 00000000000141c0 [ 7677.703091] ffff88002e07f700 ffff8802443075c0 ffff880209aff5c0 ffff88010fdabb08 [ 7677.703094] ffff88010fdabc88 7fffffffffffffff 7fffffffffffffff ffff880209aff5c0 [ 7677.703097] Call Trace: [ 7677.703103] [<ffffffff817cd6b9>] schedule+0x29/0x70 [ 7677.703106] [<ffffffff817d0445>] schedule_timeout+0x1b5/0x210 [ 7677.703111] [<ffffffff8108e01a>] ? __queue_delayed_work+0xaa/0x1a0 [ 7677.703114] [<ffffffff8108e5f5>] ? try_to_grab_pending+0x65/0x80 [ 7677.703117] [<ffffffff817cebc7>] wait_for_completion+0xa7/0x160 [ 7677.703121] [<ffffffff810a3fa0>] ? try_to_wake_up+0x2a0/0x2a0 [ 7677.703126] [<ffffffff8121d6c6>] writeback_inodes_sb_nr+0x86/0xb0 [ 7677.703143] [<ffffffffc0630b9d>] shrink_delalloc+0x10d/0x300 [btrfs] [ 7677.703159] [<ffffffffc0630e68>] flush_space+0xd8/0x150 [btrfs] [ 7677.703173] [<ffffffffc0631763>] ? btrfs_async_reclaim_metadata_space+0x153/0x1d0 [btrfs] [ 7677.703186] [<ffffffffc063175b>] btrfs_async_reclaim_metadata_space+0x14b/0x1d0 [btrfs] [ 7677.703189] [<ffffffff8108f6dd>] process_one_work+0x14d/0x460 [ 7677.703192] [<ffffffff810900bb>] worker_thread+0x11b/0x3f0 [ 7677.703196] [<ffffffff8108ffa0>] ? create_worker+0x1e0/0x1e0 [ 7677.703199] [<ffffffff81095cc9>] kthread+0xc9/0xe0 [ 7677.703201] [<ffffffff81095c00>] ? flush_kthread_worker+0x90/0x90 [ 7677.703205] [<ffffffff817d17fc>] ret_from_fork+0x7c/0xb0 [ 7677.703208] [<ffffffff81095c00>] ? flush_kthread_worker+0x90/0x90 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1415510 Title: Frequent kernel panics when doing heavy I/O in LXC containers on Btrfs To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1415510/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs