[Kernel-packages] [Bug 1635597] Comment bridged from LTC Bugzilla

bugproxy Wed, 13 Sep 2017 10:32:30 -0700

------- Comment From lekshmi.cpil...@in.ibm.com 2017-09-13 13:18 EDT-------
Hi


Today I tested  kdump with 16.10 on talclp3
Access info :
HMC: hmc-lte2.isst.aus.stglabs.ibm.com   (hscroot/abc123)

Console Access: rmvterm -m talc -p talclp3;mkvterm -m talc -p talclp3;

Logs:

root@talclp3:~# echo c > /proc/sysrq-trigger
[  424.180480] sysrq: SysRq : Trigger a crash
[  424.180497] Unable to handle kernel paging request for data at address 
0x00000000
[  424.180500] Faulting instruction address: 0xc0000000006a2428
[  424.180504] Oops: Kernel access of bad area, sig: 11 [#1]
[  424.180506] SMP NR_CPUS=2048 NUMA pSeries
[  424.180509] Modules linked in: nfsv3 nfs_acl rpcsec_gss_krb5 auth_rpcgss 
nfsv4 nfs lockd grace fscache rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) 
configfs ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE) 
mlx5_core(OE) mlx4_ib(OE) pseries_rng ib_core(OE) vmx_crypto binfmt_misc 
dm_round_robin sunrpc dm_multipath knem(OE) ip_tables x_tables autofs4 btrfs 
xor raid6_pq mlx4_en(OE) ibmvfc scsi_transport_fc ibmvscsi bnx2x mlx4_core(OE) 
devlink mlx_compat(OE)
mdio libcrc32c be2net crc32c_vpmsum
[  424.180541] CPU: 0 PID: 2733 Comm: bash Tainted: G           OE   
4.8.0-59-generic #64-Ubuntu
[  424.180545] task: c0000000b3d78600 task.stack: c0000000a2104000
[  424.180547] NIP: c0000000006a2428 LR: c0000000006a3478 CTR: c0000000006a2400
[  424.180550] REGS: c0000000a21079f0 TRAP: 0300   Tainted: G           OE    
(4.8.0-59-generic)
[  424.180553] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 28222222  XER: 
00000001
[  424.180560] CFAR: c000000000008750 DAR: 0000000000000000 DSISR: 42000000 
SOFTE: 1
GPR00: c0000000006a3478 c0000000a2107c70 c000000001467500 0000000000000063
GPR04: c0000000bd00aca0 c0000000bd01fb40 c00000017fd2e300 000000000000b240
GPR08: 0000000000000007 0000000000000001 0000000000000000 0000000000000001
GPR12: c0000000006a2400 c000000007b30000 0000000000000000 0000000022000000
GPR16: 0000000010170dc8 000001000df90258 0000000010140528 00000000100c6f60
GPR20: 0000000000000000 000000001017dd58 0000000010152bf0 000000001017b608
GPR24: 00003ffff97be144 00003ffff97be140 c00000000137e6e0 0000000000000004
GPR28: c00000000137eaa0 0000000000000063 c000000001332590 0000000000000000
[  424.180599] NIP [c0000000006a2428] sysrq_handle_crash+0x28/0x30
[  424.180602] LR [c0000000006a3478] __handle_sysrq+0xe8/0x280
[  424.180604] Call Trace:
[  424.180606] [c0000000a2107c70] [c0000000006a3458] __handle_sysrq+0xc8/0x280 
(unreliable)
[  424.180610] [c0000000a2107d10] [c0000000006a3bcc] 
write_sysrq_trigger+0x6c/0x90
[  424.180614] [c0000000a2107d40] [c0000000003adb48] proc_reg_write+0x88/0xd0
[  424.180619] [c0000000a2107d70] [c0000000003105ac] __vfs_write+0x3c/0x70
[  424.180622] [c0000000a2107d90] [c000000000311814] vfs_write+0xd4/0x240
[  424.180625] [c0000000a2107de0] [c000000000313368] SyS_write+0x68/0x110
[  424.180629] [c0000000a2107e30] [c000000000009584] system_call+0x38/0xec
[  424.180631] Instruction dump:
[  424.180633] 60000000 60000000 3c4c00dc 38425100 7c0802a6 60000000 3d22001a 
3949bc60
[  424.180639] 39200001 912a0000 7c0004ac 39400000 <992a0000> 4e800020 3c4c00dc 
384250d0
[  424.180645] ---[ end trace 8fd1cd00c31ebdd4 ]---
[  424.183431]
[  424.183450] Sending IPI to other CPUs
[  424.183452] IPI complete
I'm in purgatory
-> smp_release_cpus()
spinning_secondaries = 47
<- smp_release_cpus()
[    0.184530] pci 002b:50:00.0: of_irq_parse_pci() failed with rc=-22
[    0.569039] Kernel panic - not syncing: Out of memory and no killable 
processes...
[    0.569039]
[    0.569066] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.8.0-59-generic 
#64-Ubuntu
[    0.569069] Call Trace:
[    0.569071] [c00000000d10b220] [c000000008b0fe4c] dump_stack+0xb0/0xf0 
(unreliable)
[    0.569075] [c00000000d10b260] [c000000008b0bf58] panic+0x144/0x308
[    0.569078] [c00000000d10b2f0] [c000000008249c2c] out_of_memory+0x48c/0x570
[    0.569082] [c00000000d10b3a0] [c000000008250ad8] 
__alloc_pages_nodemask+0xdf8/0xe20
[    0.569086] [c00000000d10b560] [c0000000082c6da8] 
alloc_page_interleave+0x58/0xc0
[    0.569089] [c00000000d10b5a0] [c0000000082c7678] 
alloc_pages_current+0x168/0x1d0
[    0.569093] [c00000000d10b600] [c0000000082435e8] 
__page_cache_alloc+0x118/0x160
[    0.569096] [c00000000d10b640] [c0000000082437b4] 
pagecache_get_page+0x184/0x3c0
[    0.569100] [c00000000d10b6b0] [c000000008243a34] 
grab_cache_page_write_begin+0x44/0x70
[    0.569103] [c00000000d10b6e0] [c00000000834bf6c] 
simple_write_begin+0x4c/0x1b0
[    0.569107] [c00000000d10b730] [c000000008243264] 
generic_perform_write+0x104/0x280
[    0.569111] [c00000000d10b7d0] [c000000008245540] 
__generic_file_write_iter+0x1e0/0x230
[    0.569114] [c00000000d10b830] [c00000000824567c] 
generic_file_write_iter+0xec/0x250
[    0.569118] [c00000000d10b870] [c00000000831050c] new_sync_write+0xec/0x150
[    0.569121] [c00000000d10b900] [c000000008311814] vfs_write+0xd4/0x240
[    0.569124] [c00000000d10b950] [c000000008313368] SyS_write+0x68/0x110
[    0.569127] [c00000000d10b9a0] [c000000008ea5d0c] xwrite+0x4c/0xb0
[    0.569130] [c00000000d10b9e0] [c000000008ea5e60] do_copy+0xf0/0x170
[    0.569133] [c00000000d10ba10] [c000000008ea59c4] write_buffer+0x5c/0x88
[    0.569136] [c00000000d10ba40] [c000000008ea5a50] flush_buffer+0x60/0xec
[    0.569140] [c00000000d10ba90] [c000000008eec4c8] __gunzip+0x378/0x47c
[    0.569142] [c00000000d10bb10] [c000000008ea650c] 
unpack_to_rootfs+0x1c8/0x338
[    0.569146] [c00000000d10bbc0] [c000000008ea688c] populate_rootfs+0x94/0x17c
[    0.569149] [c00000000d10bc40] [c00000000800b948] do_one_initcall+0x68/0x1d0
[    0.569152] [c00000000d10bd00] [c000000008ea42e8] 
kernel_init_freeable+0x278/0x360
[    0.569156] [c00000000d10bdc0] [c00000000800c1b4] kernel_init+0x24/0x170
[    0.569159] [c00000000d10be30] [c0000000080098f0] 
ret_from_kernel_thread+0x5c/0x6c
[    0.571060] ---[ end Kernel panic - not syncing: Out of memory and no 
killable processes...
[    0.571060]

root@talclp3:~# service kdump-tools status
* kdump-tools.service - Kernel crash dump capture service
Loaded: loaded (/lib/systemd/system/kdump-tools.service; enabled; vendor pres
Active: active (exited) since Wed 2017-09-13 12:02:16 CDT; 3min 28s ago
Main PID: 2281 (code=exited, status=0/SUCCESS)
Tasks: 0 (limit: 9830)
CGroup: /system.slice/kdump-tools.service

Sep 13 12:02:14 talclp3 systemd[1]: Starting Kernel crash dump capture service..
Sep 13 12:02:15 talclp3 kdump-tools[2281]: Starting kdump-tools: Modified cmdlin
Sep 13 12:02:16 talclp3 kdump-tools[2281]:  * loaded kdump kernel
Sep 13 12:02:16 talclp3 kdump-tools[2581]: /sbin/kexec -p --command-line="BOOT_I
Sep 13 12:02:16 talclp3 kdump-tools[2582]: loaded kdump kernel
Sep 13 12:02:16 talclp3 systemd[1]: Started Kernel crash dump capture service.
root@talclp3:~#
root@talclp3:~# uname -a
Linux talclp3 4.8.0-59-generic #64-Ubuntu SMP Thu Jun 29 19:36:04 UTC 2017 
ppc64le ppc64le ppc64le GNU/Linux
root@talclp3:~# uname -r
4.8.0-59-generic
root@talclp3:~# cat /etc/lsb-release
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=16.10
DISTRIB_CODENAME=yakkety
DISTRIB_DESCRIPTION="Ubuntu 16.10"
root@talclp3:~# cat /proc/cmdline
BOOT_IMAGE=/boot/vmlinux-4.8.0-59-generic 
root=UUID=30629c5d-7ff0-48db-b2ca-7c2255d0fa18 ro splash quiet 
crashkernel=2G-4G:320M,4G-32G:512M,32G-64G:1024M,64G-128G:2048M,128G-:4096M@32M 
maxcpus=1 crashkernel=384M-:128M
root@talclp3:~# kdump-config show
DUMP_MODE:        kdump
USE_KDUMP:        1
KDUMP_SYSCTL:     kernel.panic_on_oops=1
KDUMP_COREDIR:    /var/crash
crashkernel addr:
/var/lib/kdump/vmlinuz: symbolic link to /boot/vmlinux-4.8.0-59-generic
kdump initrd:
/var/lib/kdump/initrd.img: symbolic link to 
/var/lib/kdump/initrd.img-4.8.0-59-generic
current state:    ready to kdump

kexec command:
/sbin/kexec -p --command-line="BOOT_IMAGE=/boot/vmlinux-4.8.0-59-generic 
root=UUID=30629c5d-7ff0-48db-b2ca-7c2255d0fa18 ro splash quiet maxcpus=1 
irqpoll nr_cpus=1 nousb systemd.unit=kdump-tools.service" 
--initrd=/var/lib/kdump/initrd.img /var/lib/kdump/vmlinuz
root@talclp3:~# apt list --installed|grep makedumpfile

makedumpfile/yakkety-updates,now 1:1.6.0-2ubuntu1.2 ppc64el
[installed,automatic]

Thanks
Lekshmi

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to makedumpfile in Ubuntu.
https://bugs.launchpad.net/bugs/1635597

Title:
  Ubuntu16.10:talclp1: Kdump failed with multipath disk

Status in The Ubuntu-power-systems project:
  Incomplete
Status in linux package in Ubuntu:
  New
Status in makedumpfile package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  New
Status in makedumpfile source package in Trusty:
  Confirmed
Status in linux source package in Xenial:
  New
Status in makedumpfile source package in Xenial:
  Confirmed
Status in linux source package in Zesty:
  New
Status in makedumpfile source package in Zesty:
  Confirmed

Bug description:
  Problem  Description
  ==========================
  On talclp1, I enabled kdump. But kdump failed and it drop to BusyBox.

  root@talclp1:~# echo c> /proc/sysrq-trigger
  [  132.643690] sysrq: SysRq : Trigger a crash
  [  132.643739] Unable to handle kernel paging request for data at address 
0x00000000
  [  132.643745] Faulting instruction address: 0xc0000000005c28f4
  [  132.643749] Oops: Kernel access of bad area, sig: 11 [#1]
  [  132.643753] SMP NR_CPUS=2048 NUMA pSeries
  [  132.643758] Modules linked in: fuse ufs qnx4 hfsplus hfs minix ntfs msdos 
jfs rpadlpar_io rpaphp rpcsec_gss_krb5 nfsv4 dccp_diag cifs nfs dns_resolver 
dccp tcp_diag fscache udp_diag inet_diag unix_diag af_packet_diag netlink_diag 
binfmt_misc xfs libcrc32c pseries_rng rng_core ghash_generic gf128mul 
vmx_crypto sg nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables x_tables 
autofs4 ext4 crc16 jbd2 fscrypto mbcache crc32c_generic btrfs xor raid6_pq 
dm_round_robin sr_mod sd_mod cdrom ses enclosure scsi_transport_sas ibmveth 
crc32c_vpmsum ipr scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath dm_mod
  [  132.643819] CPU: 49 PID: 10174 Comm: bash Not tainted 4.8.0-15-generic 
#16-Ubuntu
  [  132.643824] task: c000000111767080 task.stack: c0000000d82e0000
  [  132.643828] NIP: c0000000005c28f4 LR: c0000000005c39d8 CTR: 
c0000000005c28c0
  [  132.643832] REGS: c0000000d82e3990 TRAP: 0300   Not tainted  
(4.8.0-15-generic)
  [  132.643836] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 28242422  
XER: 00000001
  [  132.643848] CFAR: c0000000000087d0 DAR: 0000000000000000 DSISR: 42000000 
SOFTE: 1
  GPR00: c0000000005c39d8 c0000000d82e3c10 c000000000f67b00 0000000000000063
  GPR04: c00000011d04a9b8 c00000011d05f7e0 c00000047fb00000 0000000000015998
  GPR08: 0000000000000007 0000000000000001 0000000000000000 0000000000000001
  GPR12: c0000000005c28c0 c000000007b4b900 ffffffffffffffff 0000000022000000
  GPR16: 0000000010170dc8 000001002b566368 0000000010140f58 00000000100c7570
  GPR20: 0000000000000000 000000001017dd58 0000000010153618 000000001017b608
  GPR24: 00003ffffe87a294 0000000000000001 c000000000ebff60 0000000000000004
  GPR28: c000000000ec0320 0000000000000063 c000000000e72a90 0000000000000000
  [  132.643906] NIP [c0000000005c28f4] sysrq_handle_crash+0x34/0x50
  [  132.643911] LR [c0000000005c39d8] __handle_sysrq+0xe8/0x280
  [  132.643914] Call Trace:
  [  132.643917] [c0000000d82e3c10] [c000000000a245e8] 0xc000000000a245e8 
(unreliable)
  [  132.643923] [c0000000d82e3c30] [c0000000005c39d8] __handle_sysrq+0xe8/0x280
  [  132.643928] [c0000000d82e3cd0] [c0000000005c4188] 
write_sysrq_trigger+0x78/0xa0
  [  132.643935] [c0000000d82e3d00] [c0000000003ad770] proc_reg_write+0xb0/0x110
  [  132.643941] [c0000000d82e3d50] [c00000000030fc3c] __vfs_write+0x6c/0xe0
  [  132.643946] [c0000000d82e3d90] [c000000000311144] vfs_write+0xd4/0x240
  [  132.643950] [c0000000d82e3de0] [c000000000312e5c] SyS_write+0x6c/0x110
  [  132.643957] [c0000000d82e3e30] [c0000000000095e0] system_call+0x38/0x108
  [  132.643961] Instruction dump:
  [  132.643963] 38425240 7c0802a6 f8010010 f821ffe1 60000000 60000000 3d220019 
3949ba60
  [  132.643972] 39200001 912a0000 7c0004ac 39400000 <992a0000> 38210020 
e8010010 7c0803a6
  [  132.643981] ---[ end trace eed6bbcd2c3bdfdf ]---
  [  132.646105]
  [  132.646176] Sending IPI to other CPUs
  [  132.647490] IPI complete
  I'm in purgatory
   -> smp_release_cpus()
  spinning_secondaries = 104
   <- smp_release_cpus()
  [    2.011346] alg: hash: Test 1 failed for crc32c-vpmsum
  [    2.729254] sd 0:2:0:0: [sda] Assuming drive cache: write through
  [    2.731554] sd 1:2:5:0: [sdn] Assuming drive cache: write through
  [    2.739087] sd 1:2:4:0: [sdm] Assuming drive cache: write through
  [    2.739089] sd 1:2:6:0: [sdo] Assuming drive cache: write through
  [    2.739110] sd 1:2:7:0: [sdp] Assuming drive cache: write through
  [    2.739115] sd 1:2:0:0: [sdi] Assuming drive cache: write through
  [    2.739122] sd 1:2:3:0: [sdl] Assuming drive cache: write through
  [    2.739123] sd 1:2:2:0: [sdk] Assuming drive cache: write through
  [    2.739148] sd 1:2:1:0: [sdj] Assuming drive cache: write through
  [    2.748938] sd 0:2:1:0: [sdb] Assuming drive cache: write through
  [    2.748939] sd 0:2:7:0: [sdh] Assuming drive cache: write through
  [    2.748940] sd 0:2:6:0: [sdg] Assuming drive cache: write through
  [    2.748942] sd 0:2:2:0: [sdc] Assuming drive cache: write through
  [    2.748958] sd 0:2:5:0: [sdf] Assuming drive cache: write through
  [    2.748963] sd 0:2:4:0: [sde] Assuming drive cache: write through
  [    2.748978] sd 0:2:3:0: [sdd] Assuming drive cache: write through
  [    2.999087] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.119912] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.252513] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.343680] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.381234] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.419515] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.474587] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.482188] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.531439] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.552824] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.594489] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.619222] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.672208] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.680298] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.731718] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.761333] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.794955] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.819212] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.871913] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.889439] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    3.922620] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    3.960707] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    4.002959] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    4.035611] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    4.054476] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    4.092241] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    4.099432] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    4.182358] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    4.182823] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    4.234767] device-mapper: table: 254:1: multipath: error attaching 
hardware handler
  [    4.333309] device-mapper: table: 254:0: multipath: error attaching 
hardware handler
  [    4.402827] device-mapper: table: 254:0: multipath: error attaching 
hardware handler

  
  Gave up waiting for root device.  Common problems:
   - Boot args (cat /proc/cmdline)
     - Check rootdelay= (did the system wait long enough?)
     - Check root= (did the system wait for the right device?)
   - Missing modules (cat /proc/modules; ls /dev)
  ALERT!  UUID=853769e5-1dc5-41be-a689-b430320d207f does not exist.  Dropping 
to a shell!

  
  BusyBox v1.22.1 (Ubuntu 1:1.22.0-19ubuntu2) built-in shell (ash)
  Enter 'help' for a list of built-in commands.

  (initramfs)

  
  == Comment: #7 - Vaishnavi Bhat <vaish...@in.ibm.com> - 2016-10-07 05:37:53 ==
  The blkid output does not show any device with 
UUID=853769e5-1dc5-41be-a689-b430320d207f
  which is the root device used in the kexec command line (from kdump-config 
show)
  kexec command:
    /sbin/kexec -p --command-line="BOOT_IMAGE=/boot/vmlinux-4.8.0-15-generic 
root=UUID=853769e5-1dc5-41be-a689-b430320d207f ro xmon=on splash quiet irqpoll 
nr_cpus=1 nousb systemd.unit=kdump-tools.service" 
--initrd=/var/lib/kdump/initrd.img /var/lib/kdump/vmlinuz

  Hence the kdump kernel is failing to boot here.

  == Comment: #11 - Xue Sheng Li <lixu...@cn.ibm.com> - 2016-10-17 01:54:56 ==
  recreated with -24 kernel.

  root@talclp1:~# echo c > /proc/sysrq-trigger
  [   72.655416] sysrq: SysRq : Trigger a crash
  [   72.655458] Unable to handle kernel paging request for data at address 
0x00000000
  [   72.655463] Faulting instruction address: 0xc00000000069d148
  [   72.655469] Oops: Kernel access of bad area, sig: 11 [#1]
  [   72.655472] SMP NR_CPUS=2048 NUMA pSeries
  [   72.655477] Modules linked in: rpadlpar_io rpaphp dccp_diag dccp tcp_diag 
udp_diag inet_diag unix_diag af_packet_diag netlink_diag rpcsec_gss_krb5 nfsv4 
nfs cifs fscache binfmt_misc xfs pseries_rng vmx_crypto nfsd auth_rpcgss 
nfs_acl lockd grace sunrpc ip_tables x_tables autofs4 btrfs xor raid6_pq 
dm_round_robin ses enclosure scsi_transport_sas bnx2x ipr mdio libcrc32c 
crc32c_vpmsum scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath
  [   72.655521] CPU: 25 PID: 9730 Comm: bash Not tainted 4.8.0-24-generic 
#26-Ubuntu
  [   72.655525] task: c0000001d8451e00 task.stack: c0000001d8494000
  [   72.655529] NIP: c00000000069d148 LR: c00000000069e198 CTR: 
c00000000069d120
  [   72.655534] REGS: c0000001d84979f0 TRAP: 0300   Not tainted  
(4.8.0-24-generic)
  [   72.655537] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 28242222  
XER: 00000001
  [   72.655549] CFAR: c000000000008750 DAR: 0000000000000000 DSISR: 42000000 
SOFTE: 1
  GPR00: c00000000069e198 c0000001d8497c70 c000000001476700 0000000000000063
  GPR04: c00000047e64aca0 c00000047e65fb40 c00000047df00000 0000000000015ed8
  GPR08: 0000000000000007 0000000000000001 0000000000000000 0000000000000001
  GPR12: c00000000069d120 c000000007b3e100 ffffffffffffffff 0000000022000000
  GPR16: 0000000010170dc8 0000010036d36398 0000000010140f58 00000000100c7570
  GPR20: 0000000000000000 000000001017dd58 0000000010153618 000000001017b608
  GPR24: 00003ffff5582464 0000000000000001 c00000000138e6a0 0000000000000004
  GPR28: c00000000138ea60 0000000000000063 c000000001342590 0000000000000000
  [   72.655608] NIP [c00000000069d148] sysrq_handle_crash+0x28/0x30
  [   72.655613] LR [c00000000069e198] __handle_sysrq+0xe8/0x280
  [   72.655616] Call Trace:
  [   72.655619] [c0000001d8497c70] [c00000000069e178] 
__handle_sysrq+0xc8/0x280 (unreliable)
  [   72.655625] [c0000001d8497d10] [c00000000069e8ec] 
write_sysrq_trigger+0x6c/0x90
  [   72.655631] [c0000001d8497d40] [c0000000003a9568] proc_reg_write+0x88/0xd0
  [   72.655637] [c0000001d8497d70] [c00000000030c40c] __vfs_write+0x3c/0x70
  [   72.655642] [c0000001d8497d90] [c00000000030d674] vfs_write+0xd4/0x240
  [   72.655647] [c0000001d8497de0] [c00000000030f1c8] SyS_write+0x68/0x110
  [   72.655652] [c0000001d8497e30] [c000000000009584] system_call+0x38/0xec
  [   72.655656] Instruction dump:
  [   72.655658] 60000000 60000000 3c4c00de 384295e0 7c0802a6 60000000 3d22001a 
3949c8e0
  [   72.655667] 39200001 912a0000 7c0004ac 39400000 <992a0000> 4e800020 
3c4c00de 384295b0
  [   72.655677] ---[ end trace 43b490f085103bf5 ]---
  [   72.659366]
  [   72.659429] Sending IPI to other CPUs
  [   72.660740] IPI complete
  I'm in purgatory
   -> smp_release_cpus()
  spinning_secondaries = 104
   <- smp_release_cpus()
  [    1.699068] ibmveth 30000002 (unnamed net_device) (uninitialized): unable 
to change IPv4 checksum offload settings. 1 rc=4
  [    1.699093] ibmveth 30000002 (unnamed net_device) (uninitialized): unable 
to change IPv6 checksum offload settings. 1 rc=4
  [    1.699101] ibmveth 30000002 (unnamed net_device) (uninitialized): unable 
to change tso settings. 1 rc=4
  [    2.657700] sd 0:2:1:0: [sdb] Assuming drive cache: write through
  [    2.657701] sd 0:2:0:0: [sda] Assuming drive cache: write through
  [    2.657781] sd 0:2:2:0: [sdc] Assuming drive cache: write through
  [    2.660641] sd 0:2:7:0: [sdh] Assuming drive cache: write through
  [    2.667731] sd 0:2:4:0: [sde] Assuming drive cache: write through
  [    2.677685] sd 0:2:6:0: [sdg] Assuming drive cache: write through
  [    2.677688] sd 0:2:5:0: [sdf] Assuming drive cache: write through
  [    2.677708] sd 0:2:3:0: [sdd] Assuming drive cache: write through
  [    2.697737] sd 1:2:6:0: [sdo] Assuming drive cache: write through
  [    2.697743] sd 1:2:1:0: [sdj] Assuming drive cache: write through
  [    2.697744] sd 1:2:4:0: [sdm] Assuming drive cache: write through
  [    2.697747] sd 1:2:2:0: [sdk] Assuming drive cache: write through
  [    2.697749] sd 1:2:3:0: [sdl] Assuming drive cache: write through
  [    2.697753] sd 1:2:5:0: [sdn] Assuming drive cache: write through
  [    2.699340] sd 1:2:7:0: [sdp] Assuming drive cache: write through
  [    2.699360] sd 1:2:0:0: [sdi] Assuming drive cache: write through
  [    3.350794] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.471468] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.540387] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.628523] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.657731] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    3.733416] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.752066] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    3.808884] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.838148] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    3.919247] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    3.950262] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    3.997839] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.007810] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.082174] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.089411] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.162200] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.202441] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.252289] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.279870] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.311712] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.348150] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.402076] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.432069] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.487871] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.518282] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.573338] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.599280] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.632144] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.671142] device-mapper: table: 252:1: multipath: error attaching 
hardware handler
  [    4.713352] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.782117] device-mapper: table: 252:0: multipath: error attaching 
hardware handler
  [    4.890336] device-mapper: table: 252:0: multipath: error attaching 
hardware handler

  == Comment: #13 - Hari Krishna Bathini <hbath...@in.ibm.com> - 2016-10-19 
16:26:57 ==
  (In reply to comment #12)
  > Hi Hari,
  > 
  > Can you please take a look at this issue and suggest what would be the next
  > step ?
  > We are facing this issue with -24 kernel as well. Can this be a issue with
  > kdump kernel that has missing multipath modules or some other issue ?
  > 

  Hi Vaishnavi,

  Necessary hardware handler modules are missing in the kdump initrd.
  Here is the console log of kdump kernel that says the same:

  --
  Begin: Loading multipath hardware handlers ... Failure: failed to load module 
scsi_dh_alua.
  Failure: failed to load module scsi_dh_rdac.
  Failure: failed to load module scsi_dh_emc.
  --

  Including this modules explicitly and rebuilding initrd for kdump, able to 
get to a point
  where makedumpfile starts to capture dump but fails with:

      "get_mem_map: Can't distinguish the memory type."

  which is already tracked with bug 146571

  Thanks
  Hari

  PS1: To explicitly add modules to kdump initrd
        
        1. List the necessary modules in /var/lib/kdump/initramfs-tools/modules 
file
        2. mkinitramfs -d /var/lib/kdump/initramfs-tools -o 
/var/lib/kdump/initrd.img-$kver
        3. systemctl restart kdump-tools.service

  
  Mirroring this bug to Canonical for their inputs if to include the missing 
hardware modules to the kdump initrd or to proceed with the workaround.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1635597/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

[Kernel-packages] [Bug 1635597] Comment bridged from LTC Bugzilla

Reply via email to