On Thu, 2025-03-06 at 01:07 +1300, David Rowley wrote: > I've attached the results. The 3990x with clang looks good, but the > rest are mostly slower.
I am still curious why. If it's due to compiler misoptimization, is that kind of thing often misoptimized, or is there something we're doing in particular? Even if we don't have answers, it might be worth adding a brief comment that we empirically determined that booleans are faster than bitfields or flags. In the future, maybe compilers mostly get this right, and we want to change to bitfields. Regards, Jeff Davis