jamal wrote, On 11/03/2007 12:23 AM:

> On Fri, 2007-02-11 at 18:31 +0100, Jarek Poplawski wrote:
>> Radu Rendec wrote:
>>
>>> Hi,
>>>
>>> While trying to implement u32 hashes in my shaping machine I ran into a
>>> possible bug in the u32 hash/bucket computing algorithm
>>> (net/sched/cls_u32.c).
>>>
>>> The problem occurs only with hash masks that extend over the octet
>>> boundary, on little endian machines (where htonl() actually does
>>> something).
>>>
>>> I'm not 100% sure this is a problem with u32 itself, but at least I'm
>>> sure u32 with the same configuration would behave differently on little
>>> endian and big endian machines. Detailed description of the problem and
>>> proposed patch follow.
>>
>> I think you are right about this different behavior, so it looks like a bug.
>> And since little endian way is uncontrollable in such a case, your proposal
>> should be right.
>>
>> But, since there is a maintainer for this, let's check what is he not payed
>> for?! (Cc: Jamal Hadi Salim)
>>
> 
> Thanks for the CC Jarek - and i promise to share the loot with you when
> i lay my hands on it;->
> 
> I see that given the mask described (the 0 bits bounding the two
> nibbles), the same packet in that network will hit two different buckets
> depending on endianness. In other words there is lack of consistency. So
> good catch.
> The patch would certainly resolve it.
> The only thing that bothers me with the patch approach is the extra
> conversion in the fast path. Radu, since this is not a show stopper -
> can you give me a short time to sip on it? I am thinking it is probably
> resolvable by using the right tuning at config time - one knob that
> looks usable is fshift and that all this can be done at config time; but
> i may need more than one coffee to get it right, but if you see it just
> send a patch. I will try to use the data you used to see if i am making
> any sense.
> 
> cheers,
> jamal
> 
> -
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to [EMAIL PROTECTED]
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 


static __inline__ unsigned u32_hash_fold(u32 key, struct tc_u32_sel *sel, u8 
fshift)
{
#ifdef __LITTLE_ENDIAN
        /*
         * It's a hack/optimization to avoid ntohl(). Since
         * it's only used in u32_classify(), and masked with
         * divisor (1 byte), one, least signinificant byte
         * of selection is enough.
         */
        fshift = 32 - 8 - fshift;
#endif
        unsigned h = (key & sel->hmask) >> fshift;

        return h;
}




> --- linux-2.6.22.9/net/sched/cls_u32.c.orig   2007-10-30 17:08:03.000000000 
> +0200
> +++ linux-2.6.22.9/net/sched/cls_u32.c        2007-10-30 17:04:49.000000000 
> +0200
> @@ -198,7 +198,7 @@
>               ht = n->ht_down;
>               sel = 0;
>               if (ht->divisor)
> -                     sel = 
> ht->divisor&u32_hash_fold(*(u32*)(ptr+n->sel.hoff), &n->sel,n->fshift);
> +                     sel = 
> ht->divisor&u32_hash_fold(ntohl(*(u32*)(ptr+n->sel.hoff)), &n->sel,n->fshift);
>  
>               if (!(n->sel.flags&(TC_U32_VAROFFSET|TC_U32_OFFSET|TC_U32_EAT)))
>                       goto next_ht;
> @@ -626,6 +626,10 @@
>       }
>  #endif
>  
> +     /* userspace tc tool sends us the hmask in network order, but we
> +      * need host order, so change it here */
> +     s->hmask = ntohl(s->hmask);
> +
>       memcpy(&n->sel, s, sizeof(*s) + s->nkeys*sizeof(struct tc_u32_key));
>       n->ht_up = ht;
>       n->handle = handle;
> @@ -735,9 +739,14 @@
>               u32 divisor = ht->divisor+1;
>               RTA_PUT(skb, TCA_U32_DIVISOR, 4, &divisor);
>       } else {
> +             /* get the address where the selector will be put, then
> +              * change the hmask after it is put there */
> +             struct tc_u32_sel *s =
> +                     (struct tc_u32_sel *)RTA_DATA(skb_tail_pointer(skb));
>               RTA_PUT(skb, TCA_U32_SEL,
>                       sizeof(n->sel) + n->sel.nkeys*sizeof(struct tc_u32_key),
>                       &n->sel);
> +             s->hmask = htonl(s->hmask);
>               if (n->ht_up) {
>                       u32 htid = n->handle & 0xFFFFF000;
>                       RTA_PUT(skb, TCA_U32_HASH, 4, &htid);
> 
> 
> -
-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to