That one hung much like the others, with the system responding only to
very basic commands (mostly bash internals), although the dmesg output
continued further after the kernel bug message. Here's the dmesg output:

[  107.652875] EDAC MC0: Giving out device to module sb_edac.c controller 
Haswell SrcID#1_Ha#0: DEV 0000:ff:12.0 (INTERRUPT)
[  107.652995] EDAC MC1: Giving out device to module sb_edac.c controller 
Haswell SrcID#0_Ha#0: DEV 0000:7f:12.0 (INTERRUPT)
[  107.653010] EDAC sbridge: Some needed devices are missing
[  107.675559] EDAC MC: Removed device 0 for sb_edac.c Haswell SrcID#1_Ha#0: 
DEV 0000:ff:12.0
[  107.703606] EDAC MC: Removed device 1 for sb_edac.c Haswell SrcID#0_Ha#0: 
DEV 0000:7f:12.0
[  107.703639] EDAC sbridge: Couldn't find mci handler
[  107.704195] EDAC sbridge: Couldn't find mci handler
[  107.704618] EDAC sbridge: Failed to register device with error -19.
[  108.163612] smpboot: Booting Node 1 Processor 8 APIC 0x10
[  108.189804] intel_rapl: Found RAPL domain package
[  108.189810] intel_rapl: Found RAPL domain dram
[  108.189812] intel_rapl: DRAM domain energy unit 15300pj
[  108.190389] ------------[ cut here ]------------
[  108.190390] kernel BUG at 
/home/jsalisbury/bugs/lp1733662/ubuntu-artful/mm/slub.c:3878!
[  108.191016] invalid opcode: 0000 [#1] SMP
[  108.191511] Modules linked in: nls_iso8859_1 intel_rapl x86_pkg_temp_thermal 
intel_powerclamp coretemp ipmi_ssif kvm_intel kvm input_leds irqbypass joydev 
mei_me intel_cstate ipmi_si intel_rapl_perf shpchp acpi_power_meter 
ipmi_devintf ipmi_msghandler mei lpc_ich mac_hid acpi_pad ib_iser rdma_cm iw_cm 
ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 
btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx 
xor raid6_pq libcrc32c raid1 raid0 multipath linear ses enclosure 
scsi_transport_sas crct10dif_pclmul crc32_pclmul mgag200 ghash_clmulni_intel 
ttm pcbc fnic hid_generic drm_kms_helper igb syscopyarea aesni_intel usbhid dca 
sysfillrect i2c_algo_bit sysimgblt aes_x86_64 ptp fb_sys_fops crypto_simd 
mxm_wmi hid libfcoe glue_helper ahci cryptd libfc drm
[  108.195174]  libahci pps_core enic scsi_transport_fc megaraid_sas wmi
[  108.195756] CPU: 8 PID: 302 Comm: kworker/8:3 Not tainted 4.13.0-13-generic 
#14~lp1733662Commit97327adfdaf5d
[  108.196353] Hardware name: Cisco Systems Inc UCSC-C240-M4L/UCSC-C240-M4L, 
BIOS C240M4.2.0.10c.0.032320160820 03/23/2016
[  108.196971] Workqueue: events cpuset_hotplug_workfn
[  108.197583] task: ffff8e3432fcae80 task.stack: ffffb5fb4e104000
[  108.198236] RIP: 0010:kfree+0x11c/0x160
[  108.198861] RSP: 0000:ffffb5fb4e107cc8 EFLAGS: 00010246
[  108.199485] RAX: fffffb0ffeff0020 RBX: ffff8e3400000000 RCX: 000000018020001d
[  108.200121] RDX: 0000000000000000 RSI: fffffb0fffd33600 RDI: 0000720b40000000
[  108.200764] RBP: ffffb5fb4e107ce0 R08: ffff8e3434cd8c00 R09: 000000018020001d
[  108.201405] R10: fffffb0fff000000 R11: 0000000000000000 R12: ffff8e343254f058
[  108.202053] R13: ffffffff876ce3d3 R14: ffff8e34382b6d10 R15: 0000000000000000
[  108.202703] FS:  0000000000000000(0000) GS:ffff8e343f200000(0000) 
knlGS:0000000000000000
[  108.203367] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  108.204031] CR2: 0000000000000000 CR3: 00000026c7609000 CR4: 00000000001406e0
[  108.204702] Call Trace:
[  108.205377]  sd_free_ctl_entry+0x63/0x70
[  108.206054]  sd_free_ctl_entry+0x53/0x70
[  108.206727]  unregister_sched_domain_sysctl+0x36/0x40
[  108.207396]  partition_sched_domains+0x34/0x2f0
[  108.208070]  rebuild_sched_domains_locked+0x5a/0x80
[  108.208771]  rebuild_sched_domains+0x1a/0x30
[  108.209442]  cpuset_hotplug_workfn+0x1b1/0xd30
[  108.210119]  ? mutex_lock+0x12/0x40
[  108.210790]  process_one_work+0x1e7/0x410
[  108.211470]  worker_thread+0x4a/0x410
[  108.212146]  kthread+0x125/0x140
[  108.212814]  ? process_one_work+0x410/0x410
[  108.213482]  ? kthread_create_on_node+0x70/0x70
[  108.214159]  ret_from_fork+0x25/0x30
[  108.214830] Code: 08 49 83 c4 18 48 89 da 4c 89 ee ff d0 49 8b 04 24 48 85 
c0 75 e6 e9 0e ff ff ff 49 8b 02 f6 c4 80 75 0a 49 8b 42 20 a8 01 75 02 <0f> 0b 
49 8b 02 31 f6 f6 c4 80 74 04 41 8b 72 6c 4c 89 d7 e8 2c 
[  108.216269] RIP: kfree+0x11c/0x160 RSP: ffffb5fb4e107cc8
[  108.217042] ---[ end trace 8c27258fb7e406c8 ]---
[  108.225116] EDAC sbridge: Seeking for: PCI ID 8086:2fa0
[  108.225880] EDAC sbridge: Seeking for: PCI ID 8086:2fa0
[  108.226442] EDAC sbridge: Seeking for: PCI ID 8086:2fa0
[  108.227006] EDAC sbridge: Seeking for: PCI ID 8086:2f60
[  108.227623] EDAC sbridge: Seeking for: PCI ID 8086:2fa8
[  108.228288] EDAC sbridge: Seeking for: PCI ID 8086:2fa8
[  108.228809] EDAC sbridge: Seeking for: PCI ID 8086:2fa8
[  108.229313] EDAC sbridge: Seeking for: PCI ID 8086:2f71
[  108.229794] EDAC sbridge: Seeking for: PCI ID 8086:2f71
[  108.230253] EDAC sbridge: Seeking for: PCI ID 8086:2f71
[  108.230692] EDAC sbridge: Seeking for: PCI ID 8086:2faa
[  108.231110] EDAC sbridge: Seeking for: PCI ID 8086:2faa
[  108.231552] EDAC sbridge: Seeking for: PCI ID 8086:2faa
[  108.232040] EDAC sbridge: Seeking for: PCI ID 8086:2fab
[  108.232421] EDAC sbridge: Seeking for: PCI ID 8086:2fab
[  108.232770] EDAC sbridge: Seeking for: PCI ID 8086:2fab
[  108.233103] EDAC sbridge: Seeking for: PCI ID 8086:2fac
[  108.233415] EDAC sbridge: Seeking for: PCI ID 8086:2fac
[  108.233723] EDAC sbridge: Seeking for: PCI ID 8086:2fac
[  108.234021] EDAC sbridge: Seeking for: PCI ID 8086:2fad
[  108.234314] EDAC sbridge: Seeking for: PCI ID 8086:2fad
[  108.234597] EDAC sbridge: Seeking for: PCI ID 8086:2fad
[  108.234872] EDAC sbridge: Seeking for: PCI ID 8086:2f68
[  108.235140] EDAC sbridge: Seeking for: PCI ID 8086:2f68
[  108.235397] EDAC sbridge: Seeking for: PCI ID 8086:2f68
[  108.235697] EDAC sbridge: Seeking for: PCI ID 8086:2f79
[  108.236003] EDAC sbridge: Seeking for: PCI ID 8086:2f6a
[  108.236293] EDAC sbridge: Seeking for: PCI ID 8086:2f6b
[  108.236529] EDAC sbridge: Seeking for: PCI ID 8086:2f6c
[  108.236752] EDAC sbridge: Seeking for: PCI ID 8086:2f6d
[  108.236971] EDAC sbridge: Seeking for: PCI ID 8086:2ffc
[  108.237183] EDAC sbridge: Seeking for: PCI ID 8086:2ffc
[  108.237390] EDAC sbridge: Seeking for: PCI ID 8086:2ffc
[  108.237589] EDAC sbridge: Seeking for: PCI ID 8086:2ffd
[  108.237798] EDAC sbridge: Seeking for: PCI ID 8086:2ffd
[  108.237987] EDAC sbridge: Seeking for: PCI ID 8086:2ffd
[  108.238172] EDAC sbridge: Seeking for: PCI ID 8086:2fbd
[  108.238349] EDAC sbridge: Seeking for: PCI ID 8086:2fbd
[  108.238520] EDAC sbridge: Seeking for: PCI ID 8086:2fbd
[  108.238689] EDAC sbridge: Seeking for: PCI ID 8086:2fbf
[  108.238852] EDAC sbridge: Seeking for: PCI ID 8086:2fbf
[  108.239011] EDAC sbridge: Seeking for: PCI ID 8086:2fbf
[  108.239164] EDAC sbridge: Seeking for: PCI ID 8086:2fb9
[  108.239309] EDAC sbridge: Seeking for: PCI ID 8086:2fb9
[  108.239439] EDAC sbridge: Seeking for: PCI ID 8086:2fb9
[  108.239583] EDAC sbridge: Seeking for: PCI ID 8086:2fbb
[  108.239690] EDAC sbridge: Seeking for: PCI ID 8086:2fbb
[  108.239799] EDAC sbridge: Seeking for: PCI ID 8086:2fbb
[  108.239998] EDAC MC0: Giving out device to module sb_edac.c controller 
Haswell SrcID#1_Ha#0: DEV 0000:ff:12.0 (INTERRUPT)
[  108.240338] EDAC MC1: Giving out device to module sb_edac.c controller 
Haswell SrcID#0_Ha#0: DEV 0000:7f:12.0 (INTERRUPT)
[  108.240473] EDAC sbridge: Some needed devices are missing
[  108.267599] EDAC MC: Removed device 0 for sb_edac.c Haswell SrcID#1_Ha#0: 
DEV 0000:ff:12.0
[  108.303631] EDAC MC: Removed device 1 for sb_edac.c Haswell SrcID#0_Ha#0: 
DEV 0000:7f:12.0
[  108.304152] EDAC sbridge: Couldn't find mci handler
[  108.304369] EDAC sbridge: Couldn't find mci handler
[  108.304577] EDAC sbridge: Failed to register device with error -19.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel 4.13, not with 4.10

Status in linux package in Ubuntu:
  In Progress
Status in linux-hwe package in Ubuntu:
  New
Status in linux source package in Artful:
  In Progress
Status in linux-hwe source package in Artful:
  New
Status in linux source package in Bionic:
  In Progress
Status in linux-hwe source package in Bionic:
  New

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to