On Thu, Nov 24, 2011 at 08:19:39AM -0500, Jamal Hadi Salim wrote: > > I assume you mean something like setup_tc() which we have > > today to call into into the driver at qdisc create time. This > > happens with the RTNL held. I don't see any reason not to also > > call into the hardware on qdisc_change() I just haven't done > > it yet. > > Yes, the operative piece is "also". In other words, I should be > able to run tc qdisc blah and not see the difference. > In the distant past what i have done in the case of absence of software > support is to write the "hwardware" scheduler in the kernel. If we > already have the hardware support, then there is no need for that step. > Let tc be responsible for controlling this "hardware" qdisc. It doesnt > talk to the hardware. > A user space helper app listens to things being added and deleted by > tc in the kernel and synchronizes them via a driver-specific call. > Different drivers tend to have different lower layer "hard-coded" > ways of setting up the hardware; so you may end up with different > backends. > The challenge is synchronizing stats. > > > Although I'm pretty sure we don't want to add a new ndo_ops > > ever time we have some hardware feature we want to expose. > > Assuming there are more than 1 or 2 hw features. So maybe > > we could convert to something more generic. A setup_qos() > > call that passes an skb with nl attributes. > > You only need one - call it "hardware_setup" so you can do > other esoteric things with it. > > > Is that what you were asking? > > Something like that. I described how i did it - but thats because > I wanted to make zero changes to the kernel. It is better to have > kernel support of some sort but you dont want to do too much > otherwise you start adding a lot of shit in the kernel like > the infiniband guys. Have a user space helper when in doubt. > I almost forgot, a good example (of good work in the kernel already) > you wanna take a look at is something Lennert (added to CC) did for > Marvel chips (i think it is called DSA).
The problem that net/dsa/ tries to solve is that of managing multi-port hardware ethernet switch chips (such as those found in wifi routers and such). The basic idea was to expose each port on the switch chip as a separate Linux netdev, and to mirror the Linux networking config into the switch chip, to enable offloading of as many tasks as possible to the hardware. E.g., adding two of the ports on the switch to the same bridge port group with brctl should program the switch chip to use the same address and VLAN database for the two ports, and enable forwarding of packets in hardware. A working-but-not-very-clean implementation of this is at: http://patchwork.ozlabs.org/patch/16578/ (And things like enabling promiscuous mode on a subinterface can be emulated by enabling port mirroring from the given port to the CPU port, etc.) There's a bunch of features that the hardware supports that have no analog in the Linux networking stack (e.g. port mirroring a non-CPU port to another non-CPU port), which is similar to your scenario, I guess. For those, we mostly end up with some ad-hoc sysfs interface or so, which is partly because there probably isn't enough interest in having a generic way of doing this in the upstream kernel. _______________________________________________ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev