We've switched most of instances in the cluster to c4.large instance type, but another panic occur. Panic happened at different trace, so this may not relate to the first GPF, but pasting log below for information.
Also, t2.small 4.12.x instance is now running 16 hours+, we have not seen panics yet. still keeping eyes how it goes. [ 7236.612035] BUG: unable to handle kernel paging request at 000000010000000d [ 7236.614750] IP: [<ffffffff81218247>] free_pipe_info+0x57/0x90 [ 7236.615155] PGD 0 [ 7236.615155] Oops: 0000 [#1] SMP [ 7236.615155] Modules linked in: veth binfmt_misc xt_nat xt_comment xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack x_tables nf_nat nf_conntrack br_netfilter bridge stp llc aufs isofs ppdev input_leds serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul psmouse glue_helper ablk_helper cryptd ixgbevf floppy [ 7236.615155] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic #106-Ubuntu [ 7236.615155] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017 [ 7236.615155] task: ffff8800eabf0000 ti: ffff8800eabec000 task.ti: ffff8800eabec000 [ 7236.615155] RIP: 0010:[<ffffffff81218247>] [<ffffffff81218247>] free_pipe_info+0x57/0x90 [ 7236.615155] RSP: 0018:ffff8800eabefdf8 EFLAGS: 00010202 [ 7236.615155] RAX: 00000000fffffffd RBX: 0000000000000008 RCX: 000000000000012c [ 7236.615155] RDX: 0000000000000028 RSI: ffff88005d5cc940 RDI: ffff8800e9840180 [ 7236.615155] RBP: ffff8800eabefe08 R08: 0000000000000000 R09: 0000000000000000 [ 7236.615155] R10: ffff8800e9e125b8 R11: ffff8800b5fc8510 R12: ffff8800e9840180 [ 7236.615155] R13: ffff8800e9e125b8 R14: ffff8800eab44c20 R15: ffff8800e9ee80c0 [ 7236.615155] FS: 00007fb3ddc0c8c0(0000) GS:ffff8800eb600000(0000) knlGS:0000000000000000 [ 7236.615155] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 7236.615155] CR2: 000000010000000d CR3: 000000003641f000 CR4: 00000000001406f0 [ 7236.615155] Stack: [ 7236.615155] ffff8800e9e12640 ffff8800e9840180 ffff8800eabefe30 ffffffff812182dc [ 7236.615155] ffff8800e9840180 ffff8800b5fc8500 ffff8800e9e125b8 ffff8800eabefe58 [ 7236.615155] ffffffff81218390 ffff8800b5fc8500 0000000000000008 ffff8800e9e125b8 [ 7236.615155] Call Trace: [ 7236.615155] [<ffffffff812182dc>] put_pipe_info+0x5c/0x70 [ 7236.615155] [<ffffffff81218390>] pipe_release+0xa0/0xb0 [ 7236.615155] [<ffffffff81210f34>] __fput+0xe4/0x220 [ 7236.615155] [<ffffffff812110ae>] ____fput+0xe/0x10 [ 7236.615155] [<ffffffff8109f031>] task_work_run+0x81/0xa0 [ 7236.615155] [<ffffffff81003242>] exit_to_usermode_loop+0xc2/0xd0 [ 7236.615155] [<ffffffff81003c6e>] syscall_return_slowpath+0x4e/0x60 [ 7236.615155] [<ffffffff81840cd0>] int_ret_from_sys_call+0x25/0x8f [ 7236.615155] Code: 4a e7 ff 41 8b 44 24 48 85 c0 74 2c 48 63 c3 48 8d 14 80 49 8b 84 24 80 00 00 00 48 8d 34 d0 48 8b 46 10 48 85 c0 74 06 4c 89 e7 <ff> 50 10 83 c3 01 41 39 5c 24 48 77 d4 49 8b 7c 24 68 48 85 ff [ 7236.615155] RIP [<ffffffff81218247>] free_pipe_info+0x57/0x90 [ 7236.615155] RSP <ffff8800eabefdf8> [ 7236.615155] CR2: 000000010000000d [ 7236.723319] ---[ end trace aca2b9bb73327372 ]--- [ 7236.726141] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000009 [ 7236.726141] [ 7236.729702] Kernel Offset: disabled -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1702665 Title: 4.4.0-83-generic + Docker + EC2 t2.small frequently crashes at cgroup_rmdir GPF Status in docker package in Ubuntu: New Status in linux package in Ubuntu: Confirmed Bug description: We run xenial-based Docker container hosts on EC2 with Amazon ECS. Recently we refreshed our base image, we started to see frequent panic. Hosts run Amazon ECS Agent, and the agent automatically creates or destroys Docker container based on requests onto ECS cluster. I think this crash is caused by Docker-related operations, because crashing at cgroups. Also, we're running several different cluster with another EC2 instance types, using same image. This problem is only reproducing at t2.small instances. (We also run c4.large and m4.* clusters) Our previous image ran 4.4.0-79-generic, and we see no problem with 79. [30558.783899] general protection fault: 0000 [#1] SMP [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd psmouse floppy [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic #106-Ubuntu [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017 [30558.784056] task: ffff88007c4e8000 ti: ffff88007c4e4000 task.ti: ffff88007c4e4000 [30558.784056] RIP: 0010:[<ffffffff811193ff>] [<ffffffff811193ff>] cgroup_destroy_locked+0x5f/0xf0 [30558.784056] RSP: 0018:ffff88007c4e7e40 EFLAGS: 00010212 [30558.784056] RAX: ffff8800114481bd RBX: ffff88002827ba50 RCX: ffff88007ab8d150 [30558.784056] RDX: 00111e7e00ffff88 RSI: ffff88002827ba54 RDI: ffffffff8217745c [30558.784056] RBP: ffff88007c4e7e60 R08: 0000000000000020 R09: ffff88007c4e7e70 [30558.784056] R10: 000000000637760b R11: ffff880011829a80 R12: ffff88007ab8d000 [30558.784056] R13: 0000000000000000 R14: 0000559b48b2dcc0 R15: 00000000ffffff9c [30558.784056] FS: 00007f29cf0db8c0(0000) GS:ffff88007d200000(0000) knlGS:0000000000000000 [30558.784056] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [30558.784056] CR2: 00007fa466d58180 CR3: 000000007c190000 CR4: 00000000001406f0 [30558.784056] Stack: [30558.784056] ffff88002827ba50 ffff88002827ba50 ffff8800373e70d0 0000559b48b2dcc0 [30558.784056] ffff88007c4e7e80 ffffffff811194b3 ffff88002827ba50 0000000000000000 [30558.784056] ffff88007c4e7ea0 ffffffff8128ddcd ffff880011829a80 0000000000000000 [30558.784056] Call Trace: [30558.784056] [<ffffffff811194b3>] cgroup_rmdir+0x23/0x40 [30558.784056] [<ffffffff8128ddcd>] kernfs_iop_rmdir+0x4d/0x80 [30558.784056] [<ffffffff8121b134>] vfs_rmdir+0xb4/0x130 [30558.784056] [<ffffffff8121f83f>] do_rmdir+0x1df/0x200 [30558.784056] [<ffffffff81220546>] SyS_rmdir+0x16/0x20 [30558.784056] [<ffffffff81840b72>] entry_SYSCALL_64_fastpath+0x16/0x71 [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08 <c6> 82 b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 [30558.784056] RIP [<ffffffff811193ff>] cgroup_destroy_locked+0x5f/0xf0 [30558.784056] RSP <ffff88007c4e7e40> [30558.960828] ---[ end trace 7634e03ff94e8934 ]--- [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt [30558.968805] Kernel Offset: disabled ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-83-generic 4.4.0-83.106 ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70 Uname: Linux 4.4.0-83-generic x86_64 AlsaDevices: total 0 crw-rw---- 1 root audio 116, 1 Jul 6 10:22 seq crw-rw---- 1 root audio 116, 33 Jul 6 10:22 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.6 Architecture: amd64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Thu Jul 6 10:27:37 2017 Ec2AMI: ami-34100353 Ec2AMIManifest: (unknown) Ec2AvailabilityZone: ap-northeast-1c Ec2InstanceType: t2.small Ec2Kernel: unavailable Ec2Ramdisk: unavailable IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: MachineType: Xen HVM domU PciMultimedia: ProcEnviron: SHELL=/bin/bash TERM=screen-256color PATH=(custom, no user) LANG=en_US.UTF-8 ProcFB: ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.4.0-83-generic root=UUID=f76be987-234f-4071-87d4-06318cfc2135 ro cgroup_enable=memory swapaccount=1 console=tty1 console=ttyS0 RelatedPackageVersions: linux-restricted-modules-4.4.0-83-generic N/A linux-backports-modules-4.4.0-83-generic N/A linux-firmware N/A RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) WifiSyslog: dmi.bios.date: 02/16/2017 dmi.bios.vendor: Xen dmi.bios.version: 4.2.amazon dmi.chassis.type: 1 dmi.chassis.vendor: Xen dmi.modalias: dmi:bvnXen:bvr4.2.amazon:bd02/16/2017:svnXen:pnHVMdomU:pvr4.2.amazon:cvnXen:ct1:cvr: dmi.product.name: HVM domU dmi.product.version: 4.2.amazon dmi.sys.vendor: Xen To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/docker/+bug/1702665/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp