After looking at iproute2/tc sources and ovs sources, I believe the error is in ovs, In tc, the burst is calculated with
tc_calc_xmittime(size, rate) = tick_in_us * TIME_UNITS_PER_SEC * (size/rate) tick_in_us seems to be the number of ticks in a microsecond, and TIME_UNITS_PER_SEC is 1000000 In ovs the burst is calculated with tc_bytes_to_ticks(rate, size) = ticks_per_s * (size/rate) and we pass "size" as bits, while it should be bytes. https://github.com/openvswitch/ovs/blob/master/lib/netdev-linux.c#L4736 it should be (kbits_burst * 1024) / 8 instead of kbits_burst*1024 If somebody could doublecheck, that'd be great. On Tue, Apr 12, 2016 at 11:05 AM, Miguel Angel Ajo Pelayo <majop...@redhat.com> wrote: > Hi Ben, > > I think slawe is not worried about the 1000 vs 1024 difference. > > But on the fact that when setting 1000kbit burst on an policy, > when you check via tc cmdline, you see 1000kbyte. > > Is TC command line reporting it wrong?, is it TC API?, or is it a > bug in OVS?. > > I'm reading the TC API and cmdline tool trying to identify that, > but I probably don't have the expertise... we will see.. > > Best regards, > > On Sun, Apr 10, 2016 at 10:25 PM, Ben Pfaff <b...@ovn.org> wrote: >> No, I'm talking about the rest of the comment, please read it again. >> The difference between 1000 vs 1024 bits is a trivial 2.4% difference. >> >> On Sun, Apr 10, 2016 at 09:46:28PM +0200, Sławek Kapłoński wrote: >>> Hello, >>> >>> Thx for anwear and checking that. AFAIU You are talking about issue with SI >>> and IEC units and problem with it. >>> I'm not talking about such issue or not issue. What I pointed here is that >>> (from comment in ovs code) ovs is doing something like: >>> "sbin/tc filter add dev <devname> parent ffff: protocol all prio 49 basic >>> police rate <kbits_rate>kbit burst <kbits_burst>k >>> " >>> please note this <kbit_burst>k >>> but problem is that when You give "k" to tc it is not kilobits but >>> kiloBYTES. >>> So ovs should do something like ".... burst <kbits_burst>kbit" in this >>> command. Or maybe it is intentional to use "k" there so IMHO message in >>> comment should be something like "... burst <kilobyte_burst>k" >>> >>> -- >>> Pozdrawiam / Best regards >>> Sławek Kapłoński >>> sla...@kaplonski.pl >>> >>> Dnia niedziela, 10 kwietnia 2016 12:36:46 CEST Ben Pfaff pisze: >>> > On Thu, Apr 07, 2016 at 09:28:50PM +0200, Sławek Kapłoński wrote: >>> > > Hello, >>> > > >>> > > I'm playing littlebit with ingress policing in openvswitch and I found >>> > > some >>> > > kind of inconsistency. >>> > > In docs and in source code >>> > > (https://github.com/openvswitch/ovs/blob/master/ >>> > > lib/netdev-linux.c#L4698) there is info that burst is set in kilobits, >>> > > but >>> > > I found that in fact it is set in kilobytes in tc. >>> > > So long story short: >>> > > >>> > > root@ubuntu-1604-test:/home/ubuntu# ovs-vsctl show >>> > > f0d1f90d-174f-47bf-89bc-bf37f2da0271 >>> > > >>> > > Bridge "br1" >>> > > >>> > > Port vethA >>> > > >>> > > Interface vethA >>> > > >>> > > Port "br1" >>> > > >>> > > Interface "br1" >>> > > >>> > > type: internal >>> > > >>> > > ovs_version: "2.5.0" >>> > > >>> > > root@ubuntu-1604-test:/home/ubuntu# ovs-vsctl set Interface vethA >>> > > ingress_policing_rate=1000 -- set Interface vethA >>> > > ingress_policing_burst=100 >>> > > >>> > > root@ubuntu-1604-test:/home/ubuntu# ovs-vsctl list interface vethA | >>> > > grep >>> > > ingress >>> > > ingress_policing_burst: 100 >>> > > ingress_policing_rate: 1000 >>> > > >>> > > >>> > > results in: >>> > > root@ubuntu-1604-test:/home/ubuntu# tc filter show dev vethA parent >>> > > ffff: >>> > > filter protocol all pref 49 basic >>> > > filter protocol all pref 49 basic handle 0x1 >>> > > >>> > > police 0x1 rate 1Mbit burst 100Kb mtu 64Kb action drop overhead 0b >>> > > >>> > > ref 1 bind 1 >>> > > >>> > > >>> > > As You can see above, burst is given in "Kb" units what, according to tc >>> > > man (http://lartc.org/manpages/tc.txt) means kilobytes. >>> > > >>> > > So question: is it intentional inconsistency or bug? If bug then where: >>> > > in >>> > > code or in docs? >>> > >>> > We applied a fix to the policing code last year, in the following >>> > commit. If you read the long comment that it adds, and then compare >>> > that with what you're saying, it sounds like tc once had a bug where it >>> > confused bits and bytes, and we modified OVS so that it had the same >>> > bug. Presumably, now tc's bug has been fixed, and so we should fix OVS >>> > the same way. >>> > >>> > --8<--------------------------cut here-------------------------->8-- >>> > >>> > From: Ben Pfaff <b...@nicira.com> >>> > Date: Fri, 13 Mar 2015 11:30:18 -0700 >>> > Subject: [PATCH] netdev-linux: Be more careful about integer overflow in >>> > policing. >>> > >>> > Otherwise the policing limits could make no sense if large rates were >>> > specified. >>> > >>> > Reported-by: Zhangguanghui <zhang.guang...@h3c.com> >>> > Signed-off-by: Ben Pfaff <b...@nicira.com> >>> > Acked-by: Ansis Atteka <aatt...@nicira.com> >>> > --- >>> > AUTHORS | 1 + >>> > lib/netdev-linux.c | 29 ++++++++++++++++++++++------- >>> > vswitchd/bridge.c | 4 ++-- >>> > 3 files changed, 25 insertions(+), 9 deletions(-) >>> > >>> > diff --git a/AUTHORS b/AUTHORS >>> > index fe79acd..d7925db 100644 >>> > --- a/AUTHORS >>> > +++ b/AUTHORS >>> > @@ -345,6 +345,7 @@ Voravit T. vora...@kth.se >>> > Yeming Zhao zhaoyem...@gmail.com >>> > Ying Chen yingc...@vmware.com >>> > Yongqiang Liu liuyq7...@gmail.com >>> > +Zhangguanghui zhang.guang...@h3c.com >>> > Ziyou Wang ziy...@vmware.com >>> > Zoltán Balogh zoltan.bal...@ericsson.com >>> > ankur dwivedi ankurengg2...@gmail.com >>> > diff --git a/lib/netdev-linux.c b/lib/netdev-linux.c >>> > index 662ccc9..6e574cd 100644 >>> > --- a/lib/netdev-linux.c >>> > +++ b/lib/netdev-linux.c >>> > @@ -1,5 +1,5 @@ >>> > /* >>> > - * Copyright (c) 2009, 2010, 2011, 2012, 2013, 2014 Nicira, Inc. >>> > + * Copyright (c) 2009, 2010, 2011, 2012, 2013, 2014, 2015 Nicira, Inc. >>> > * >>> > * Licensed under the Apache License, Version 2.0 (the "License"); >>> > * you may not use this file except in compliance with the License. >>> > @@ -399,8 +399,8 @@ static struct tcmsg *tc_make_request(const struct >>> > netdev >>> > *, int type, unsigned int flags, struct ofpbuf *); static int >>> > tc_transact(struct ofpbuf *request, struct ofpbuf **replyp); static int >>> > tc_add_del_ingress_qdisc(struct netdev *netdev, bool add); -static int >>> > tc_add_policer(struct netdev *netdev, int kbits_rate, >>> > - int kbits_burst); >>> > +static int tc_add_policer(struct netdev *, >>> > + uint32_t kbits_rate, uint32_t kbits_burst); >>> > >>> > static int tc_parse_qdisc(const struct ofpbuf *, const char **kind, >>> > struct nlattr **options); >>> > @@ -4028,12 +4028,13 @@ tc_add_del_ingress_qdisc(struct netdev *netdev, >>> > bool >>> > add) * mtu 65535 drop >>> > * >>> > * The configuration and stats may be seen with the following command: >>> > - * /sbin/tc -s filter show <devname> eth0 parent ffff: >>> > + * /sbin/tc -s filter show dev <devname> parent ffff: >>> > * >>> > * Returns 0 if successful, otherwise a positive errno value. >>> > */ >>> > static int >>> > -tc_add_policer(struct netdev *netdev, int kbits_rate, int kbits_burst) >>> > +tc_add_policer(struct netdev *netdev, >>> > + uint32_t kbits_rate, uint32_t kbits_burst) >>> > { >>> > struct tc_police tc_police; >>> > struct ofpbuf request; >>> > @@ -4047,8 +4048,22 @@ tc_add_policer(struct netdev *netdev, int >>> > kbits_rate, >>> > int kbits_burst) tc_police.action = TC_POLICE_SHOT; >>> > tc_police.mtu = mtu; >>> > tc_fill_rate(&tc_police.rate, ((uint64_t) kbits_rate * 1000)/8, mtu); >>> > - tc_police.burst = tc_bytes_to_ticks(tc_police.rate.rate, >>> > - kbits_burst * 1024); >>> > + >>> > + /* The following appears wrong in two ways: >>> > + * >>> > + * - tc_bytes_to_ticks() should take "bytes" as quantity for both of >>> > its + * arguments (or at least consistently "bytes" as both or >>> > "bits" >>> > as + * both), but this supplies bytes for the first argument and >>> > bits >>> > for the + * second. >>> > + * >>> > + * - In networking a kilobit is usually 1000 bits but this uses 1024 >>> > bits. + * >>> > + * However if you "fix" those problems then "tc filter show ..." >>> > shows >>> > + * "125000b", meaning 125,000 bits, when OVS configures it for 1000 >>> > kbit == + * 1,000,000 bits, whereas this actually ends up doing the >>> > right thing from + * tc's point of view. Whatever. */ >>> > + tc_police.burst = tc_bytes_to_ticks( >>> > + tc_police.rate.rate, MIN(UINT32_MAX / 1024, kbits_burst) * 1024); >>> > >>> > tcmsg = tc_make_request(netdev, RTM_NEWTFILTER, >>> > NLM_F_EXCL | NLM_F_CREATE, &request); >>> > diff --git a/vswitchd/bridge.c b/vswitchd/bridge.c >>> > index 85bbfa3..68648b9 100644 >>> > --- a/vswitchd/bridge.c >>> > +++ b/vswitchd/bridge.c >>> > @@ -4445,8 +4445,8 @@ iface_configure_qos(struct iface *iface, const >>> > struct >>> > ovsrec_qos *qos) } >>> > >>> > netdev_set_policing(iface->netdev, >>> > - iface->cfg->ingress_policing_rate, >>> > - iface->cfg->ingress_policing_burst); >>> > + MIN(UINT32_MAX, >>> > iface->cfg->ingress_policing_rate), >>> > + MIN(UINT32_MAX, >>> > iface->cfg->ingress_policing_burst)); >>> > >>> > ofpbuf_uninit(&queues_buf); >>> > } >> >> _______________________________________________ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev