>
> Hi,
>
> Seeing failures when trying to do PCI passthrough of Intel XL710 40G
interface to KVM vm.
> 0a:00.1 Ethernet controller: Intel Corporation Ethernet Controller
XL710 for 40GbE QSFP+ (rev 01)
>
> From dmesg on host:
>
> [80326.559674] kvm: zapping shadow pages for mmio generation wraparound
> [80327.271191] kvm [175994]: vcpu0 unhandled rdmsr: 0x1c9
> [80327.271689] kvm [175994]: vcpu0 unhandled rdmsr: 0x1a6
> [80327.272201] kvm [175994]: vcpu0 unhandled rdmsr: 0x1a7
> [80327.272681] kvm [175994]: vcpu0 unhandled rdmsr: 0x3f6
> [80327.376186] kvm [175994]: vcpu0 unhandled rdmsr: 0x606
>
> The pci device is still available in the VM but stat transfer fails.
>
> With the i40e driver, the data transfer fails.
> Relevant dmesg output:
> [ 11.544088] i40e 0000:00:05.0 eth1: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
> [ 11.689178] i40e 0000:00:06.0 eth2: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
> [ 16.704071] ------------[ cut here ]------------
> [ 16.705053] WARNING: CPU: 1 PID: 0 at net/sched/sch_generic.c:303
dev_watchdog+0x23e/0x250()
> [ 16.705053] NETDEV WATCHDOG: eth1 (i40e): transmit queue 1 timed out
> [ 16.705053] Modules linked in: cirrus ttm drm_kms_helper i40e drm
ppdev serio_raw i2c_piix4 virtio_net parport_pc ptp virtio_balloon
crct10dif_pclmul pps_core parport pvpanic crc32_pclmul ghash_clmulni_intel
virtio_blk crc32c_intel virtio_pci virtio_ring virtio ata_generic pata_acpi
> [ 16.705053] CPU: 1 PID: 0 Comm: swapper/1 Not tainted
3.18.7-200.fc21.x86_64 #1
> [ 16.705053] Hardware name: Fedora Project OpenStack Nova, BIOS
1.7.5-20140709_153950- 04/01/2014
> [ 16.705053] 0000000000000000 2e5932b294d0c473 ffff88043fc83d48
ffffffff8175e686
> [ 16.705053] 0000000000000000 ffff88043fc83da0 ffff88043fc83d88
ffffffff810991d1
> [ 16.705053] ffff88042958f5c0 0000000000000001 ffff88042865f000
0000000000000001
> [ 16.705053] Call Trace:
> [ 16.705053] <IRQ> [<ffffffff8175e686>] dump_stack+0x46/0x58
> [ 16.705053] [<ffffffff810991d1>] warn_slowpath_common+0x81/0xa0
> [ 16.705053] [<ffffffff81099245>] warn_slowpath_fmt+0x55/0x70
> [ 16.705053] [<ffffffff8166e62e>] dev_watchdog+0x23e/0x250
> [ 16.705053] [<ffffffff8166e3f0>] ? dev_graft_qdisc+0x80/0x80
> [ 16.705053] [<ffffffff810fd52a>] call_timer_fn+0x3a/0x120
> [ 16.705053] [<ffffffff8166e3f0>] ? dev_graft_qdisc+0x80/0x80
> [ 16.705053] [<ffffffff810ff692>] run_timer_softirq+0x212/0x2f0
> [ 16.705053] [<ffffffff8109d7a4>] __do_softirq+0x124/0x2d0
> [ 16.705053] [<ffffffff8109db75>] irq_exit+0x125/0x130
> [ 16.705053] [<ffffffff817681d8>] smp_apic_timer_interrupt+0x48/0x60
> [ 16.705053] [<ffffffff817662bd>] apic_timer_interrupt+0x6d/0x80
> [ 16.705053] <EOI> [<ffffffff811005c8>] ? hrtimer_start+0x18/0x20
> [ 16.705053] [<ffffffff8105ca96>] ? native_safe_halt+0x6/0x10
> [ 16.705053] [<ffffffff810f81d3>] ? rcu_eqs_enter+0xa3/0xb0
> [ 16.705053] [<ffffffff8101ec7f>] default_idle+0x1f/0xc0
> [ 16.705053] [<ffffffff8101f64f>] arch_cpu_idle+0xf/0x20
> [ 16.705053] [<ffffffff810dad35>] cpu_startup_entry+0x3c5/0x410
> [ 16.705053] [<ffffffff8104a2af>] start_secondary+0x1af/0x1f0
> [ 16.705053] ---[ end trace 7bda53aeda558267 ]---
> [ 16.705053] i40e 0000:00:05.0 eth1: tx_timeout recovery level 1
> [ 16.705053] i40e 0000:00:05.0: i40e_vsi_control_tx: VSI seid 519 Tx
ring 0 disable timeout
> [ 16.744198] i40e 0000:00:05.0: i40e_vsi_control_tx: VSI seid 520 Tx
ring 64 disable timeout
> [ 16.779322] i40e 0000:00:05.0: i40e_ptp_init: added PHC on eth1
> [ 16.791819] i40e 0000:00:05.0: PF 40 attempted to control timestamp
mode on port 1, which is owned by PF 1
> [ 16.933869] i40e 0000:00:05.0 eth1: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
> [ 18.853624] SELinux: initialized (dev tmpfs, type tmpfs), uses
transition SIDs
> [ 22.720083] i40e 0000:00:05.0 eth1: tx_timeout recovery level 2
> [ 22.826993] i40e 0000:00:05.0: i40e_vsi_control_tx: VSI seid 519 Tx
ring 0 disable timeout
> [ 22.935288] i40e 0000:00:05.0: i40e_vsi_control_tx: VSI seid 520 Tx
ring 64 disable timeout
> [ 23.669555] i40e 0000:00:05.0: i40e_ptp_init: added PHC on eth1
> [ 23.682067] i40e 0000:00:05.0: PF 40 attempted to control timestamp
mode on port 1, which is owned by PF 1
> [ 23.722423] i40e 0000:00:05.0 eth1: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
> [ 23.800206] i40e 0000:00:06.0: i40e_ptp_init: added PHC on eth2
> [ 23.813804] i40e 0000:00:06.0: PF 48 attempted to control timestamp
mode on port 0, which is owned by PF 0
> [ 23.855275] i40e 0000:00:06.0 eth2: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
> [ 38.720091] i40e 0000:00:05.0 eth1: tx_timeout recovery level 3
> [ 38.725844] random: nonblocking pool is initialized
> [ 38.729874] i40e 0000:00:06.0: HMC error interrupt
> [ 38.733425] i40e 0000:00:06.0: i40e_vsi_control_tx: VSI seid 518 Tx
ring 0 disable timeout
> [ 38.738886] i40e 0000:00:06.0: i40e_vsi_control_tx: VSI seid 521 Tx
ring 64 disable timeout
> [ 39.689569] i40e 0000:00:06.0: i40e_ptp_init: added PHC on eth2
> [ 39.704197] i40e 0000:00:06.0: PF 48 attempted to control timestamp
mode on port 0, which is owned by PF 0
> [ 39.746879] i40e 0000:00:06.0 eth2: NIC Link is Down
> [ 39.838356] i40e 0000:00:05.0: i40e_ptp_init: added PHC on eth1
> [ 39.851788] i40e 0000:00:05.0: PF 40 attempted to control timestamp
mode on port 1, which is owned by PF 1
> [ 39.892822] i40e 0000:00:05.0 eth1: NIC Link is Down
> [ 43.011610] i40e 0000:00:06.0 eth2: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
> [ 43.059976] i40e 0000:00:05.0 eth1: NIC Link is Up 40 Gbps Full
Duplex, Flow Control: None
>
>
> Would appreciate any information on how to debug this issue further and
if the "unhandled rdmsr" logs from KVM indicate some issues with the device
passthrough.
>
> Thanks
> Jacob