On Fri, 06 Jul 2018 14:03:00 +0200, Jan Beulich <jbeul...@suse.com> wrote:
I am currently researching a transmit queue timeout with Xen 4.8.2 and
Intel X722 (i40e driver). The problem occurs with various linux versions
(4.8.17, 4.13.16, SLES 15 port of i40e). The problem seems to be related
to heavy forwarding/bridging as I am running a heavy network stress test
in a domU (linux/pvm 4.13.16). It seems that if I run the same test
without Xen, it works (not sure).
The log fragment below of course tells about nothing on why this
is happening. Couple of questions therefore:

Thanks for suggesting helpful further steps.

- Are interrupts still arriving for this device at the point of the
  reported timeout?
- Are interrupts distributed reasonably evenly between (v)CPUs?
- Is the overall interrupt rate not higher than what the system
  can reasonably handle (the lower handling overhead means
  without Xen a higher rate would still be acceptable)?
- Is the same heavy forwarding/bridging in effect when trying this
  without Xen?
- Does running the same stress test in Dom0 work?
I take it that there are no other relevant messages in any of the
logs, or else you would have provided them right away.

Actually, it seems that the driver is the problem which it quite counterintuitive because the driver is quite old (started in 2013) and you would expect it to be very mature.

For a test I used the 2.4.10 version from https://sourceforge.net/projects/e1000/files/i40e%20stable/ and all the problems went away. I am writing this here so that others with the same problem have a possible solution to try.

Regards Andreas

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xenproject.org
https://lists.xenproject.org/mailman/listinfo/xen-devel

Reply via email to