On Tue, Jan 14, 2025 at 12:59:04AM -0500, Tom Lane wrote: > John Naylor <johncnaylo...@gmail.com> writes: >> We can do about as well simply by changing the nibble lookup to a byte >> lookup, which works on every compiler and architecture:
Nice. I tried enabling auto-vectorization and loop unrolling on top of this patch, and the numbers looked the same. I think we'd need CPU intrinsics or an even bigger lookup table to do any better. > I didn't attempt to verify your patch, but I do prefer addressing > this issue in a machine-independent fashion. I also like the brevity > of the patch (though it could do with some comments perhaps, not that > the existing code has any). +1 -- nathan