Re: [FFmpeg-devel] [PATCH] swscale/aarch64: dotprod implementation of rgba32_to_Y

2025-03-01 Thread Martin Storsjö
On Thu, 27 Feb 2025, Krzysztof Pyrkosz via ffmpeg-devel wrote: --- I was curious whether it's possible to implement this function without any widening, and it turns out it not only is, but it's quite performant at the same time! The idea is to split the 16 bit coefficients into lower and upper

Re: [FFmpeg-devel] [PATCH] swscale/aarch64: dotprod implementation of rgba32_to_Y

2025-02-28 Thread Niklas Haas
On Fri, 28 Feb 2025 11:49:53 +0100 Andreas Rheinhardt wrote: > Niklas Haas: > > On Fri, 28 Feb 2025 10:31:19 +0800 Zhao Zhili > > wrote: > >> Cc haasn. > >> > >> Libswscale in under refactor. Does current asm works after refactor, or > >> they need to be refactored or > >> rewrite after? If it

Re: [FFmpeg-devel] [PATCH] swscale/aarch64: dotprod implementation of rgba32_to_Y

2025-02-28 Thread Andreas Rheinhardt
Niklas Haas: > On Fri, 28 Feb 2025 10:31:19 +0800 Zhao Zhili wrote: >> Cc haasn. >> >> Libswscale in under refactor. Does current asm works after refactor, or they >> need to be refactored or >> rewrite after? If it’s the second case, maybe we should hold on to do more >> asm with libswscale >>

Re: [FFmpeg-devel] [PATCH] swscale/aarch64: dotprod implementation of rgba32_to_Y

2025-02-28 Thread Martin Storsjö
On Fri, 28 Feb 2025, Niklas Haas wrote: On Fri, 28 Feb 2025 10:31:19 +0800 Zhao Zhili wrote: Cc haasn. Libswscale in under refactor. Does current asm works after refactor, or they need to be refactored or rewrite after? If it’s the second case, maybe we should hold on to do more asm with li

Re: [FFmpeg-devel] [PATCH] swscale/aarch64: dotprod implementation of rgba32_to_Y

2025-02-28 Thread Niklas Haas
On Fri, 28 Feb 2025 10:31:19 +0800 Zhao Zhili wrote: > Cc haasn. > > Libswscale in under refactor. Does current asm works after refactor, or they > need to be refactored or > rewrite after? If it’s the second case, maybe we should hold on to do more > asm with libswscale > before hassn work done

Re: [FFmpeg-devel] [PATCH] swscale/aarch64: dotprod implementation of rgba32_to_Y

2025-02-27 Thread Zhao Zhili
Cc haasn. Libswscale in under refactor. Does current asm works after refactor, or they need to be refactored or rewrite after? If it’s the second case, maybe we should hold on to do more asm with libswscale before hassn work done. > On Feb 28, 2025, at 06:44, Krzysztof Pyrkosz via ffmpeg-devel