Hi, On 2025-03-05 12:15:57 -0800, Jeff Davis wrote: > On Wed, 2025-03-05 at 11:33 -0800, James Hunter wrote: > > For a bitfield, however, the CPU has to read from or write to the > > byte > > that contains the bit, but then it also has to mask out the *other* > > bits in that bitfield. This is a data dependency, so it stalls the > > CPU > > pipeline. > > Here the bits aren't changing, so we're only talking about mask-and- > test, right? My intuition is that wouldn't cause much of a problem.
FWIW, I am fairly certain that I looked at this at an earlier state of the patch, and at least for me the issue wasn't that it was inherently slower to use the bitmask, but that it was hard to convince the compiler not generate worse code. IIRC the compiler generated more complicated address gathering instructions which are slower on some older microarchitectures, but this is a vague memory. Greetings, Andres