On Mon, 2015-01-12 at 11:55 +1100, Anton Blanchard wrote:
> Hi David,
>
> > The unrolled loop (deleted) looks excessive.
> > On a modern cpu with multiple execution units you can usually
> > manage to get the loop overhead to execute in parallel to the
> > actual 'work'.
> > So I suspect that a m
Hi Folks,
Hope to humbly ask for some advice if this list is still active. I’m
supporting a legacy product and looking to improve networking performance. It
is a PPC460GT used as a bridge device with an ethernet 1Gb (marvell), that
routes packets to a pci network card (using igb).
Going in th
Hi David,
> The unrolled loop (deleted) looks excessive.
> On a modern cpu with multiple execution units you can usually
> manage to get the loop overhead to execute in parallel to the
> actual 'work'.
> So I suspect that a much simpler 'word at a time' loop will be
> almost as fast - especially i