Reposting internal comment to make it visible to Canonical and hopefully
with Thomas' help determine if this should be marked as verification-
failed-xenial and additional patches will be necessary.

> Att: Linux team,
>
> I installed the latest ubuntu package. Here is the details.
>
> hmc  alphmc4
> cec : rdx2fp1
> lpar : rdx2p03
>
> # uname -a
> Linux rdx2p03 4.4.0-44-generic #64-Ubuntu SMP Mon Oct 17 08:43:50 UTC 2016
> ppc64le ppc64le ppc64le GNU/Linux
>
> (0) root @ rdx2p03: /root
> # apt-get update
> Hit:1 http://us.ports.ubuntu.com/ubuntu-ports xenial InRelease
> Hit:2 http://us.ports.ubuntu.com/ubuntu-ports xenial-updates InRelease
> Hit:3 http://ports.ubuntu.com/ubuntu-ports xenial-security InRelease
> Hit:4 http://us.ports.ubuntu.com/ubuntu-ports xenial-backports InRelease
> Hit:5 http://ports.ubuntu.com/ubuntu-ports xenial-proposed InRelease
> Reading package lists... Done
>
> (0) root @ rdx2p03: /root
> # apt-get upgrade
> Reading package lists... Done
> Building dependency tree
> Reading state information... Done
> Calculating upgrade... Done
> 0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
>
> (0) root @ rdx2p03: /root
> # apt-get dist-upgrade
> Reading package lists... Done
> Building dependency tree
> Reading state information... Done
> Calculating upgrade... Done
> 0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
>
> (0) root @ rdx2p03: /root
>
> I was able to create vNIC and backing device.
>
> No response for lspci.
>
> # ip link
> 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode
> DEFAULT group default qlen 1
>     link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
> 2: ibmveth2: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast
> state UNKNOWN mode DEFAULT group default qlen 1000
>     link/ether 06:73:67:dd:71:02 brd ff:ff:ff:ff:ff:ff
> 3: eth0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT
> group default qlen 1000
>     link/ether 06:73:67:dd:71:07 brd ff:ff:ff:ff:ff:ff
> 4: eth1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT
> group default qlen 1000
>     link/ether 06:73:67:dd:71:08 brd ff:ff:ff:ff:ff:ff
>
> But build_net help failed with oops msg
>
> Automation script will now try to detect your network topology.
> In case of errors manually edit bpt file and run build_net bpt to setup
> hxecom networks
> [  140.400376] Unable to handle kernel paging request for data at address
> 0x00000000
> [  140.400386] Faulting instruction address: 0xd0000000017b6534
> [  140.400391] Oops: Kernel access of bad area, sig: 11 [#1]
> [  140.400393] SMP NR_CPUS=2048 NUMA pSeries
> [  140.400398] ibmvnic 30000007: Error 7 in QUERY_MAP_RSP
> [  140.400403] Modules linked in: rpadlpar_io rpaphp dccp_diag dccp tcp_diag
> udp_diag inet_diag unix_diag af_packet_diag netlink_diag binfmt_misc
> pseries_rng rtc_generic dm_round_robin sunrpc dm_multipath autofs4 ibmvnic
> ibmveth ibmvscsi
> [  140.400421] CPU: 5 PID: 13808 Comm: ip Tainted: G        W
> 4.4.0-44-generic #64-Ubuntu
> [  140.400424] task: c00000014b4d26e0 ti: c00000014b5b4000 task.ti:
> c00000014b5b4000
> [  140.400427] NIP: d0000000017b6534 LR: d0000000017b65f0 CTR:
> 00000000004526e0
> [  140.400430] REGS: c00000014b5b70c0 TRAP: 0300   Tainted: G        W
> (4.4.0-44-generic)
> [  140.400433] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 24008284
> XER: 00000001
> [  140.400442] CFAR: c000000000008468 DAR: 0000000000000000 DSISR: 40000000
> SOFTE: 1
> GPR00: d0000000017b65f0 c00000014b5b7340 d0000000017c4250 c0000001fbe62900
> GPR04: 0000000000010360 0000000000000000 0000000000000360 0000000000000000
> GPR08: 0000000000000000 0000000000000000 0000000000000000 00000000004526e0
> GPR12: 0000000000008800 c00000000fb42f80 00003fffd49ce548 0000000000000000
> GPR16: c0000001fbe62000 0000000000000000 c0000000fcf79300 c0000001f89c3448
> GPR20: 0000000000000000 c0000001f89c3400 0000000000000010 0000000000000000
> GPR24: d0000000017b6670 0000000000000000 0000000000000000 0000000000000000
> GPR28: c0000001fbe62900 0000000000000004 0000000000000001 0000000000000000
> [  140.400481] NIP [d0000000017b6534] ibmvnic_open+0x894/0x9d0 [ibmvnic]
> [  140.400485] LR [d0000000017b65f0] ibmvnic_open+0x950/0x9d0 [ibmvnic]
> [  140.400487] Call Trace:
> [  140.400490] [c00000014b5b7340] [d0000000017b65f0]
> ibmvnic_open+0x950/0x9d0 [ibmvnic] (unreliable)
> [  140.400497] [c00000014b5b7440] [c00000000098cc20] __dev_open+0x160/0x220
> [  140.400501] [c00000014b5b74d0] [c00000000098d0b8]
> __dev_change_flags+0xd8/0x1e0
> [  140.400504] [c00000014b5b7550] [c00000000098d208]
> dev_change_flags+0x48/0xa0
> [  140.400508] [c00000014b5b75a0] [c0000000009a28a8] do_setlink+0x388/0xd00
> [  140.400511] [c00000014b5b7700] [c0000000009a3920] rtnl_newlink+0x580/0x9b0
> [  140.400514] [c00000014b5b79c0] [c0000000009a1c80]
> rtnetlink_rcv_msg+0x140/0x2c0
> [  140.400518] [c00000014b5b7a40] [c0000000009d4288]
> netlink_rcv_skb+0x118/0x150
> [  140.400522] [c00000014b5b7a80] [c0000000009a1b14] rtnetlink_rcv+0x44/0x70
> [  140.400525] [c00000014b5b7ab0] [c0000000009d3678]
> netlink_unicast+0x198/0x240
> [  140.400529] [c00000014b5b7b10] [c0000000009d3e2c]
> netlink_sendmsg+0x58c/0x6f0
> [  140.400532] [c00000014b5b7bb0] [c00000000095e8dc] sock_sendmsg+0x6c/0xa0
> [  140.400535] [c00000014b5b7be0] [c00000000095f69c]
> ___sys_sendmsg+0x2fc/0x310
> [  140.400539] [c00000014b5b7d80] [c000000000960e10] __sys_sendmsg+0x60/0xc0
> [  140.400543] [c00000014b5b7e30] [c000000000009204] system_call+0x38/0xb4
> [  140.400545] Instruction dump:
> [  140.400547] 89300ad0 2f890000 419e00dc e9500e58 39200000 3bc00000
> 2faa0000 419e0030
> [  140.400553] e9500d30 79291f24 7f83e378 3bde0001 <7c8a482a> 7fde07b4
> 4bffa03d e9300e58
> [  140.400560] ---[ end trace cedfa219b7b7feb2 ]---
> [  140.402250]

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1626222

Title:
  STC860:alpine-pdq:alpdq5p03: kernel panic when adding vnic

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Xenial:
  Fix Committed
Status in linux source package in Yakkety:
  Fix Released

Bug description:
  Description:    Ubuntu 16.04 LTS
  Linux alpdq5p03 4.4.0-22-generic #40-Ubuntu SMP Thu May 12 22:03:35 UTC 2016 
ppc64le ppc64le ppc64le GNU/Linux

  Problem Description :

  When I created a vNIC on alpdq5p03 using U78C7.001.KIC1906-P1-C12-T1
  via alpdq5p08 and U78C7.001.KIC1906-P1-C12-T2 via alpdq5p09 as the
  backing devices I got a kernel panic.

  [  129.977324] ibmvnic 30000004: Virtual Adapter failed (rc=8)
  [  129.977346] Unable to handle kernel paging request for data at address 
0x000000f1
  [  129.977351] Faulting instruction address: 0xd000000010882434
  [  129.977355] Oops: Kernel access of bad area, sig: 11 [#1]
  [  129.977359] SMP NR_CPUS=2048 NUMA pSeries
  [  129.977364] Modules linked in: ibmvnic rpadlpar_io rpaphp nfsv3 nfs_acl 
rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache dccp_diag dccp 
tcp_diag udp_diag inet_diag unix_diag af_packet_diag netlink_diag rtc_generic 
pseries_rng binfmt_misc sunrpc autofs4 ibmvscsi ibmveth
  [  129.977392] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.4.0-22-generic 
#40-Ubuntu
  [  129.977397] task: c0000000015566b0 ti: c000001ffffd8000 task.ti: 
c0000000015b0000
  [  129.977401] NIP: d000000010882434 LR: d000000010882420 CTR: 
c0000000002b25f0
  [  129.977406] REGS: c000001ffffdb9c0 TRAP: 0300   Not tainted  
(4.4.0-22-generic)
  [  129.977410] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 48048224  
XER: 20000010
  [  129.977422] CFAR: c000000000008468 DAR: 00000000000000f1 DSISR: 40000000 
SOFTE: 0 
  GPR00: d000000010882420 c000001ffffdbc40 d000000010894240 0000000000000000 
  GPR04: c000001ff4039ee0 0000000000000000 0000001ffdd80000 0000000000002082 
  GPR08: c000000000f9fda8 c000000000f9fda8 0000001ffdd80000 d0000000108896e8 
  GPR12: c0000000002b25f0 c00000000e7e0000 c0000000015eef18 c000000000f9d678 
  GPR16: 0000000000000001 0000000000000000 c000001fd3960e18 c000001fd3960db0 
  GPR20: c0000000015b0000 c000001fd3960d98 c000001fd3960e08 c000001fd3960900 
  GPR24: c000001fd3960d88 c000001fd39bf000 c000001fd39bf048 0000000000000000 
  GPR28: 0000000000000100 0000000000000200 00000000000000f0 c000001fd3960900 
  [  129.977482] NIP [d000000010882434] ibmvnic_free_inflight+0x104/0x330 
[ibmvnic]
  [  129.977488] LR [d000000010882420] ibmvnic_free_inflight+0xf0/0x330 
[ibmvnic]
  [  129.977492] Call Trace:
  [  129.977496] [c000001ffffdbc40] [d000000010882420] 
ibmvnic_free_inflight+0xf0/0x330 [ibmvnic] (unreliable)
  [  129.977503] [c000001ffffdbce0] [d000000010887188] 
ibmvnic_handle_crq+0x3a8/0x2010 [ibmvnic]
  [  129.977509] [c000001ffffdbda0] [d000000010888ea8] 
ibmvnic_interrupt+0xb8/0x198 [ibmvnic]
  [  129.977517] [c000001ffffdbe10] [c000000000130660] 
handle_irq_event_percpu+0xa0/0x320
  [  129.977523] [c000001ffffdbed0] [c000000000130948] 
handle_irq_event+0x68/0xc0
  [  129.977528] [c000001ffffdbf00] [c000000000135c2c] 
handle_fasteoi_irq+0xec/0x2b0
  [  129.977534] [c000001ffffdbf30] [c00000000012f844] 
generic_handle_irq+0x54/0x80
  [  129.977539] [c000001ffffdbf60] [c000000000011320] __do_irq+0x80/0x1d0
  [  129.977545] [c000001ffffdbf90] [c000000000024800] call_do_irq+0x14/0x24
  [  129.977550] [c0000000015b39a0] [c000000000011508] do_IRQ+0x98/0x140
  [  129.977555] [c0000000015b39f0] [c000000000002594] 
hardware_interrupt_common+0x114/0x180
  [  129.977563] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [  129.977563]     LR = check_and_cede_processor+0x34/0x50
  [  129.977570] [c0000000015b3ce0] [c00000000090d9f0] 
check_and_cede_processor+0x20/0x50 (unreliable)
  [  129.977576] [c0000000015b3d40] [c00000000090da94] 
dedicated_cede_loop+0x74/0x190
  [  129.977582] [c0000000015b3d80] [c00000000090ac20] 
cpuidle_enter_state+0x160/0x410
  [  129.977588] [c0000000015b3de0] [c000000000119a88] call_cpuidle+0x78/0xd0
  [  129.977593] [c0000000015b3e20] [c000000000119e5c] 
cpu_startup_entry+0x37c/0x480
  [  129.977599] [c0000000015b3ee0] [c00000000000bdcc] rest_init+0xac/0xc0
  [  129.977605] [c0000000015b3f00] [c000000000ea3f5c] start_kernel+0x53c/0x558
  [  129.977610] [c0000000015b3f90] [c000000000008c6c] 
start_here_common+0x20/0xa8
  [  129.977615] Instruction dump:
  [  129.977617] e95e0010 7fc3f378 f92a0008 f9490000 fb9e0010 fbbe0018 480072cd 
e8410018 
  [  129.977626] ebde0010 7fb6f040 3bdefff0 419e00f4 <893e0001> 2f890009 
409effb4 7ea3ab78 
  [  129.977636] ---[ end trace 79d415687d3074f2 ]---
  [  129.979495] 
  [  131.979529] Kernel panic - not syncing: Fatal exception in interrupt
  [  131.985480] ---[ end Kernel panic - not syncing: Fatal exception in 
interrupt

  There was a fix for this particular bug in the ibmvnic_free_inflight
  function pushed upstream.

  Here is the commit (actually noticed by someone else independently
  before I could push my own patch)

  
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=96183182ad05d1ce31b9048921c12bf4ad621eaf

  This patch should fix this kernel panic.

  A test kernel needs to be built and tested but mirroring to Launchpad
  for a heads up to Canonical.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1626222/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to