Re: [FFmpeg-devel] [PATCH] avutil/aarch64/tx_float_neon.S: clean up FFT4_X2

2025-02-28 Thread Lynne
On 25/02/2025 21:45, Krzysztof Pyrkosz via ffmpeg-devel wrote: --- This patch rids the code from two tbl instructions and the shuffle table. There's no fneg v0.s[3] instruction unfortunately, so I negate the whole vector and copy the last element only. It's tricky to benchmark this little change

[FFmpeg-devel] [PATCH] avutil/aarch64/tx_float_neon.S: clean up FFT4_X2

2025-02-25 Thread Krzysztof Pyrkosz via ffmpeg-devel
--- This patch rids the code from two tbl instructions and the shuffle table. There's no fneg v0.s[3] instruction unfortunately, so I negate the whole vector and copy the last element only. It's tricky to benchmark this little change but on average it seems to be beneficial. Krzysztof libavutil