Hm, OK. I did a search for the HW CRC message earlier and it sounded like the message was just for information and had no real functional impact, but maybe it does.. (http://dpdk.org/dev/patchwork/patch/12080/)
Would there be a workaround outside VPP for 2. somehow? /Tomas On 12 May 2017 at 16:30, Damjan Marion (damarion) <damar...@cisco.com> wrote: > > There are 2 problems: > > 1. HW CRC strip needs to be enabled for VFs, that’s why DPDK is failing to > init device > 2. VFs are dropping packets when .max_rx_pkt_len is set to 9216 > > Problem 1. is easy fixable by changing .hw_strip_crc to 1 in > src/plugins/dpdk/device/init.c > > Problem 2. seems to be outside of VPP control, but it can be workarounded > by setting .max_rx_pkt_len to 1518. consequence of doing this is that we > will (likely) loose jumbo frame support on VFs. > > I’m going to submit patch which fixes both issues soon (actually > workarounds 2. ), I need to play a bit more with 2. first... > > > > On 12 May 2017, at 12:08, Tomas Brännström <tomas.a.brannst...@tieto.com> > wrote: > > Unfortunately my MTU seems to be at 1500 already. > > I did an upgrade to release 1704 and now none of the interfaces are > discovered anymore. But it seems suspicious since there are basically no > log printouts at startup either, below are 1701 vs 1704 for comparision. > This isn't exclusive for this "SR-IOV" machine either I think, when running > later VPP in for example virtual box I get the same problems, so I guess > there's something additional that must be done that's maybe not documented > on the wiki yet. > > 17.01: > ------ > vlib_plugin_early_init:213: plugin path /usr/lib/vpp_plugins > vpp[5066]: vlib_pci_bind_to_uio: Skipping PCI device 0000:00:03.0 as host > interface eth0 is up > EAL: Detected 4 lcore(s) > EAL: No free hugepages reported in hugepages-1048576kB > EAL: Probing VFIO support... > EAL: WARNING: cpu flags constant_tsc=yes nonstop_tsc=no -> using > unreliable clock cycles ! > EAL: PCI device 0000:00:03.0 on NUMA socket -1 > EAL: Device is blacklisted, not initializing > EAL: PCI device 0000:00:06.0 on NUMA socket -1 > EAL: probe driver: 8086:10ed net_ixgbe_vf > EAL: PCI device 0000:00:07.0 on NUMA socket -1 > EAL: probe driver: 8086:10ed net_ixgbe_vf > DPDK physical memory layout: > Segment 0: phys:0x5cc00000, len:2097152, virt:0x7f7e0b800000, socket_id:0, > hugepage_sz:2097152, nchannel:0, nrank:0 > Segment 1: phys:0x5d000000, len:266338304, virt:0x7f7db1600000, > socket_id:0, hugepage_sz:2097152, nchannel:0, nrank:0 > PMD: ixgbevf_dev_configure(): VF can't disable HW CRC Strip > PMD: ixgbevf_dev_configure(): VF can't disable HW CRC Strip > > 17.04: > ------ > vlib_plugin_early_init:360: plugin path /usr/lib/vpp_plugins > > /Tomas > > On 12 May 2017 at 11:49, Gonsalves, Avinash (Nokia - IN/Bangalore) < > avinash.gonsal...@nokia.com> wrote: > >> I faced a similar issue with SR-IOV, and for some reason setting the MTU >> size to 1500 on the interface helped with ARP resolution. >> >> >> >> Thanks, >> Avinash >> >> >> >> ================================================================================ >> >> >> >> Thanks. I can try to use a later VPP version. A thing to note is that when >> >> we did try to use the master release before, VPP failed to discover >> >> interfaces, even when they were whitelisted. Not sure if something has >> >> changed in the way VPP discover interfaces in later versions. I will try >> >> with 1704 though. >> >> >> >> Ah OK sorry, should have realized what PF was in this context. Not sure of >> >> all the info that might be needed but here's what I could think of: >> >> >> >> root at node-4 <https://lists.fd.io/mailman/listinfo/vpp-dev>:~# lspci -t -v >> >> [...] >> >> +-03.0-[0b-0c]--+-00.0 Intel Corporation 82599ES 10-Gigabit >> >> SFI/SFP+ Network Connection >> >> | +-00.1 Intel Corporation 82599ES 10-Gigabit >> >> SFI/SFP+ Network Connection >> >> | +-10.0 Intel Corporation 82599 Ethernet >> >> Controller Virtual Function >> >> | +-10.1 Intel Corporation 82599 Ethernet >> >> Controller Virtual Function >> >> [...] >> >> >> >> root at node-4 <https://lists.fd.io/mailman/listinfo/vpp-dev>:~# lshw -class >> network -businfo >> >> Bus info Device Class Description >> >> ============================================================ >> >> pci at 0000 <https://lists.fd.io/mailman/listinfo/vpp-dev>:0b:00.0 >> enp11s0f0 network 82599ES 10-Gigabit >> >> SFI/SFP+ Network Connection >> >> pci at 0000 <https://lists.fd.io/mailman/listinfo/vpp-dev>:0b:00.1 >> enp11s0f1 network 82599ES 10-Gigabit >> >> SFI/SFP+ Network Connection >> >> >> >> root at node-4 <https://lists.fd.io/mailman/listinfo/vpp-dev>:~# cat >> /sys/class/net/enp11s0f0/device/sriov_totalvfs >> >> 63 >> >> >> >> root at node-4 <https://lists.fd.io/mailman/listinfo/vpp-dev>:~# cat >> /sys/class/net/enp11s0f0/device/sriov_numvfs >> >> 16 >> >> >> >> root at node-4 <https://lists.fd.io/mailman/listinfo/vpp-dev>:~# ethtool -i >> enp11s0f0 >> >> driver: ixgbe >> >> version: 3.15.1-k >> >> firmware-version: 0x61c10001 >> >> bus-info: 0000:0b:00.0 >> >> supports-statistics: yes >> >> supports-test: yes >> >> supports-eeprom-access: yes >> >> supports-register-dump: yes >> >> supports-priv-flags: no >> >> >> >> /Tomas >> >> >> >> On 12 May 2017 at 10:44, Damjan Marion (damarion) <damarion at cisco.com >> <https://lists.fd.io/mailman/listinfo/vpp-dev>> >> >> wrote: >> >> >> >> > >> >> >* On 12 May 2017, at 08:01, Tomas Brännström <tomas.a.brannstrom at >> >tieto.com <https://lists.fd.io/mailman/listinfo/vpp-dev>>* >> >> >* wrote:* >> >> > >> >> >* *(I forgot to mention before, this is running with VPP installed from* >> >> >* binaries with release .stable.1701)** >> >> > >> >> > >> >> >* I strongly suggest that you use 17.04 release at least.* >> >> > >> >> > >> >> >* With PF do you mean packet filter? I don't think we have any such* >> >> >* configuration. If there is anything else I should provide then please >> >tell* >> >> >* :)* >> >> > >> >> > >> >> >* PF = SR-IOV Physical Function* >> >> > >> >> > >> >> >* I decided to try to attach to the VPP process with gdb and I actually get* >> >> >* a crash when trying to do "ip probe":* >> >> > >> >> >* vpp# ip probe 10.0.1.1 TenGigabitEthernet0/6/0* >> >> >* exec error: Misc* >> >> > >> >> >* Program received signal SIGSEGV, Segmentation fault.* >> >> >* ip4_probe_neighbor (vm=vm at entry >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>=0x7f681533e720 >> ><vlib_global_main>,* >> >> >* dst=dst at entry >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>=0x7f67d345cc50, >> >sw_if_index=sw_if_index at entry >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>=1)* >> >> >* at /w/workspace/vpp-merge-1701-ubuntu1404/build-data/../vnet/* >> >> >* vnet/ip/ip4_forward.c:2223* >> >> >* 2223 >> >/w/workspace/vpp-merge-1701-ubuntu1404/build-data/../vnet/vnet/ip/ip4_forward.c:* >> >> >* No such file or directory.* >> >> >* (gdb) bt* >> >> >* #0 ip4_probe_neighbor (vm=vm at entry >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>=0x7f681533e720 >> ><vlib_global_main>,* >> >> >* dst=dst at entry >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>=0x7f67d345cc50, >> >sw_if_index=sw_if_index at entry >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>=1)* >> >> >* at /w/workspace/vpp-merge-1701-ubuntu1404/build-data/../vnet/* >> >> >* vnet/ip/ip4_forward.c:2223* >> >> > >> >> >* Whether this is related or not I'm not sure because yesterday I could do* >> >> >* the probe but got "Resolution failed". I've attached the stack trace at >> >any* >> >> >* rate.* >> >> > >> >> >* /Tomas* >> >> > >> >> >* On 11 May 2017 at 20:25, Damjan Marion (damarion) <damarion at cisco.com >> ><https://lists.fd.io/mailman/listinfo/vpp-dev>>* >> >> >* wrote:* >> >> > >> >> >>* Dear Tomas,* >> >> >> >> >> >>* Can you please share your PF configuration so I can try to reproduce?* >> >> >> >> >> >>* Thanks,* >> >> >> >> >> >>* Damjan* >> >> >> >> >> >>* On 11 May 2017, at 17:07, Tomas Brännström <tomas.a.brannstrom at >> >>tieto.com <https://lists.fd.io/mailman/listinfo/vpp-dev>>* >> >> >>* wrote:* >> >> >> >> >> >>* Hello* >> >> >>* Since the last mail I sent I've managed to get our test client working* >> >> >>* and VPP running in a KVM VM.* >> >> >> >> >> >>* We are still facing some problems though. We have a two servers, one* >> >> >>* where the virtual machines are running and one we use as the openstack* >> >> >>* controller. They are connected to each other with a 10G NIC. We have >> >>SR-IOV* >> >> >>* configured for the 10G NIC.* >> >> >> >> >> >>* So VPP is installed in a VM, and all interfaces work OK, then can be* >> >> >>* reached from outside the VM etc. Following the basic examples on the >> >>wiki,* >> >> >>* we configure VPP to take over the interfaces:* >> >> >> >> >> >>* vpp# set int ip address TenGigabitEthernet0/6/0 10.0.1.101/24 >> >><http://10.0.1.101/24>* >> >> >>* vpp# set int ip address TenGigabitEthernet0/7/0 10.0.2.101/24 >> >><http://10.0.2.101/24>* >> >> >>* vpp# set int state TenGigabitEthernet0/6/0 up* >> >> >>* vpp# set int state TenGigabitEthernet0/7/0 up* >> >> >> >> >> >>* But when trying to ping for example the physical NIC on the other >> >>server,* >> >> >>* we get no reply:* >> >> >> >> >> >>* vpp# ip probe 10.0.1.1 TenGigabitEthernet0/6/0* >> >> >>* ip probe-neighbor: Resolution failed for 10.0.1.1* >> >> >> >> >> >>* If I do a tcpdump on the physical interface when trying to ping, I see* >> >> >>* ARP packets being sent so -something- is happening, but it seems that* >> >> >>* packets are not correctly arriving to VPP... I can't ping from the >> >>physical* >> >> >>* host either, but the ARP cache is updated on the host when trying to >> >>ping* >> >> >>* from VPP.* >> >> >> >> >> >>* I've tried dumping counters etc. but I can't really see anything. The* >> >> >>* trace does not show anything either. This is the output from "show* >> >> >>* hardware":* >> >> >> >> >> >>* vpp# show hardware* >> >> >>* Name Idx Link Hardware* >> >> >>* TenGigabitEthernet0/6/0 1 up TenGigabitEthernet0/6/0* >> >> >>* Ethernet address fa:16:3e:04:42:d1* >> >> >>* Intel 82599 VF* >> >> >>* carrier up full duplex speed 10000 mtu 9216* >> >> >>* rx queues 1, rx desc 1024, tx queues 1, tx desc 1024* >> >> >> >> >> >>* tx frames ok 3* >> >> >>* tx bytes ok 126* >> >> >>* extended stats:* >> >> >>* tx good packets 3* >> >> >>* tx good bytes 126* >> >> >>* TenGigabitEthernet0/7/0 2 up TenGigabitEthernet0/7/0* >> >> >>* Ethernet address fa:16:3e:f2:15:a5* >> >> >>* Intel 82599 VF* >> >> >>* carrier up full duplex speed 10000 mtu 9216* >> >> >>* rx queues 1, rx desc 1024, tx queues 1, tx desc 1024* >> >> >> >> >> >>* I've tried a similar setup between two virtual box VM's and that worked* >> >> >>* OK, so I'm thinking it might have something to do with SR-IOV for some* >> >> >>* reason. I'm having a hard time troubleshooting this since I'm not sure >> >>how* >> >> >>* to check where the packets actually get lost...* >> >> >> >> >> >>* /Tomas* >> >> >> >> >> >>* _______________________________________________* >> >> >>* vpp-dev mailing list* >> >> >>* vpp-dev at lists.fd.io <https://lists.fd.io/mailman/listinfo/vpp-dev>* >> >> >>* https://lists.fd.io/mailman/listinfo/vpp-dev >> >><https://lists.fd.io/mailman/listinfo/vpp-dev>* >> >> >> >> >> >> >> >> >> >> >> >* <vpp_stacktrace.txt>* >> >> > >> >> > >> >> > >> >> -------------- next part -------------- >> >> An HTML attachment was scrubbed... >> >> URL: >> <http://lists.fd.io/pipermail/vpp-dev/attachments/20170512/5a0abc73/attachment.html> >> >> >> > > _______________________________________________ > vpp-dev mailing list > vpp-dev@lists.fd.io > https://lists.fd.io/mailman/listinfo/vpp-dev > > >
_______________________________________________ vpp-dev mailing list vpp-dev@lists.fd.io https://lists.fd.io/mailman/listinfo/vpp-dev