Re: [PATCH] wc: speed-up by simplifying avx code

2024-03-31 Thread Pádraig Brady
On 31/03/2024 00:18, Evgeny Nizhibitsky wrote: Here is the proposed patch for both simplifying and consistently speeding up the avx version of wc -l by 10% in up to 1 billion rows scenarios on 7800X3D (probably should be tested on different data samples and CPUs). The patch was mangled, but I

Re: [PATCH] wc: speed-up by simplifying avx code

2024-03-31 Thread Pádraig Brady
On 31/03/2024 13:12, Pádraig Brady wrote: On 31/03/2024 00:18, Evgeny Nizhibitsky wrote: Here is the proposed patch for both simplifying and consistently speeding up the avx version of wc -l by 10% in up to 1 billion rows scenarios on 7800X3D (probably should be tested on different data sample

Re: [PATCH] wc: speed-up by simplifying avx code

2024-03-31 Thread Evgeny Nizhibitsky
Yes, it's true that simplifying and speeding-up by the bufsize increase are two different things although the former allowed the latter. I just landed more tests with hyperfine for various configurations spanning over the current master version and a new approach with a range of bufsizes from 16 K

Re: [PATCH] wc: speed-up by simplifying avx code

2024-03-31 Thread Pádraig Brady
On 31/03/2024 18:58, Evgeny Nizhibitsky wrote: Yes, it's true that simplifying and speeding-up by the bufsize increase are two different things although the former allowed the latter. I just landed more tests with hyperfine for various configurations spanning over the current master version an

Re: coreutils-9.5 released [stable]

2024-03-31 Thread Richard Stallman
Congratulations on the new release. -- Dr Richard Stallman (https://stallman.org) Chief GNUisance of the GNU Project (https://gnu.org) Founder, Free Software Foundation (https://fsf.org) Internet Hall-of-Famer (https://internethalloffame.org)