Hi Kristoffer,

Sorry for late response.

On Tue, Jul 29, 2014 at 4:30 PM, Kristoffer Egefelt <kristof...@itoc.dk> wrote:
> I actually asked them:
> http://lists.opencontrail.org/pipermail/users_lists.opencontrail.org/2014-July/000338.html
>
> As I understood it, they are able to offload from VM to VM, without 
> segmentation - maybe I’m wroing, and what actually is happening is that their 
> vrouter module, on the receiving side, is able to reassemble the packets to 
> 64k before sending to the vif.
> This reassembly I cannot reproduce with OVS.
> Maybe this is how they reach the 9.18Gbps ?
>

In fact the optimization I mentioned to VXLAN was similar in the way
you described.
Earlier discussion in OVS community:
http://openvswitch.org/pipermail/discuss/2014-May/013981.html

The OVS patch in my Github:
https://github.com/hzhou8/openvswitch/commit/9a7deb8b432ce83a9c09d7d4ff85fa050f7dd2be

RFC draft:
http://tools.ietf.org/html/draft-zhou-li-vxlan-soe-01

However this cannot achieve 9+ Gbits/sec for a single flow. It was
around 2.5 times increase from 2.6G to 6.7G.
Linux kernel had an optimization for GRO and GSO for UDP tunnels,
VXLAN is one use case. Should be similar in GRE tunnels. The approach
is different but performance gains are similar, because all these
solutions are to reassemble in receiving host to avoid small packet to
RX VMs.

Back to the OpenContrail case, I think they might be using multiple
flows instead of one to achieve 9+ Gbits/sec. Without hardware
acceleration (disable TSO/GRO on NIC), even with baremetal we can only
achieve 6+ Gbits/sec with a single flow. So I don't believe the
software offloading of vRouter can compete NIC acceleration.

Best regards,
Han
_______________________________________________
discuss mailing list
discuss@openvswitch.org
http://openvswitch.org/mailman/listinfo/discuss

Reply via email to