Re: [PATCH v5 06/10] util/bufferiszero: Improve scalar variant

2024-02-17 Thread Richard Henderson
On 2/17/24 02:13, Alexander Monakov wrote: On Fri, 16 Feb 2024, Richard Henderson wrote: Split less-than and greater-than 256 cases. Use unaligned accesses for head and tail. Avoid using out-of-bounds pointers in loop boundary conditions. I guess it did not carry typedef uint64_t uint64_

Re: [PATCH v5 06/10] util/bufferiszero: Improve scalar variant

2024-02-17 Thread Alexander Monakov
On Fri, 16 Feb 2024, Richard Henderson wrote: > Split less-than and greater-than 256 cases. > Use unaligned accesses for head and tail. > Avoid using out-of-bounds pointers in loop boundary conditions. I guess it did not carry typedef uint64_t uint64_a __attribute__((may_alias)); along the

[PATCH v5 06/10] util/bufferiszero: Improve scalar variant

2024-02-16 Thread Richard Henderson
Split less-than and greater-than 256 cases. Use unaligned accesses for head and tail. Avoid using out-of-bounds pointers in loop boundary conditions. Signed-off-by: Richard Henderson --- util/bufferiszero.c | 86 +++-- 1 file changed, 52 insertions(+), 34