[FFmpeg-devel] [PATCH v2 5/6] lavc/hevc: R-V V put_pixels(pow2)

2024-11-30 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 put_hevc_pel_pixels4_8_c: 61.6 ( 1.00x)69.5 ( 1.00x) put_hevc_pel_pixels4_8_rvv_i32: 24.6 ( 2.50x)28.0 ( 2.48x) put_hevc_pel_pixels8_8_c: 209.8 ( 1.00x)215.5 ( 1.00x)

[FFmpeg-devel] [PATCH v2 4/6] lavc/vvc_mc: R-V V put_pixels

2024-11-30 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 put_chroma_pixels_8_4x4_c: 63.5 ( 1.00x)59.2 ( 1.00x) put_chroma_pixels_8_4x4_rvv_i32:26.5 ( 2.39x)28.0 ( 2.12x) put_chroma_pixels_8_8

[FFmpeg-devel] [PATCH v2 3/6] lavc/riscv: Move VVC macro to h26x

2024-11-30 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/h26x/asm.S | 127 ++ libavcodec/riscv/vvc/vvc_mc_rvv.S | 117 ++- 2 files changed, 132 insertions(+), 112 deletions(-) create mode 100644 libavcodec/riscv/h26x/asm.S diff --git a/libavcodec/riscv/h26

[FFmpeg-devel] [PATCH v2 2/6] lavc/vvc_mc: R-V V dmvr

2024-11-30 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 dmvr_8_12x20_c: 619.3 ( 1.00x)624.1 ( 1.00x) dmvr_8_12x20_rvv_i32: 128.6 ( 4.82x)103.4 ( 6.04x) dmvr_8_20x12_c: 610.0 ( 1.00x)665.6 ( 1.00x) dm

[FFmpeg-devel] [PATCH v2 1/6] Update R-V V vvc_mc vset to support more lengths

2024-11-30 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/vvc/vvc_mc_rvv.S | 46 +++ 1 file changed, 23 insertions(+), 23 deletions(-) diff --git a/libavcodec/riscv/vvc/vvc_mc_rvv.S b/libavcodec/riscv/vvc/vvc_mc_rvv.S index 45f4750f82..18532616d9 100644 --- a/libavcodec/riscv/vvc/vvc_mc

[FFmpeg-devel] [PATCH v2 6/6] lavc/vvc_mc R-V V sad

2024-11-30 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 sad_8x16_c: 387.7 ( 1.00x)394.9 ( 1.00x) sad_8x16_rvv_i32: 109.7 ( 3.53x)103.5 ( 3.82x) sad_16x8_c: 378.2 ( 1.00x)384.7 ( 1.00x) sad_16x8_rvv_i32:82.0 ( 4.61x

[FFmpeg-devel] [PATCH 1/2] checkasm/rv40dsp: cover more cases for rv40_bias

2024-12-04 Thread uk7b-at-foxmail . com
From: sunyuechi --- tests/checkasm/rv40dsp.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/tests/checkasm/rv40dsp.c b/tests/checkasm/rv40dsp.c index a1a873d430..0600b07d09 100644 --- a/tests/checkasm/rv40dsp.c +++ b/tests/checkasm/rv40dsp.c @@ -47,8 +47,8 @@ static void

[FFmpeg-devel] [PATCH 2/2] lavc/rv40dsp: fix RISC-V chroma_mc

2024-12-04 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/rv40dsp_rvv.S | 116 ++--- 1 file changed, 78 insertions(+), 38 deletions(-) diff --git a/libavcodec/riscv/rv40dsp_rvv.S b/libavcodec/riscv/rv40dsp_rvv.S index ca431eb8ab..d4beb7f1e9 100644 --- a/libavcodec/riscv/rv40dsp_rvv.S +++

[FFmpeg-devel] [PATCH v2 1/2] checkasm/rv40dsp: cover more cases

2024-12-04 Thread uk7b-at-foxmail . com
From: sunyuechi --- tests/checkasm/rv40dsp.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/tests/checkasm/rv40dsp.c b/tests/checkasm/rv40dsp.c index a1a873d430..8d2252116c 100644 --- a/tests/checkasm/rv40dsp.c +++ b/tests/checkasm/rv40dsp.c @@ -27,7 +27,7 @@ #define

[FFmpeg-devel] [PATCH v2 2/2] lavc/rv40dsp: fix RISC-V chroma_mc

2024-12-04 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/rv40dsp_rvv.S | 116 ++--- 1 file changed, 78 insertions(+), 38 deletions(-) diff --git a/libavcodec/riscv/rv40dsp_rvv.S b/libavcodec/riscv/rv40dsp_rvv.S index ca431eb8ab..d4beb7f1e9 100644 --- a/libavcodec/riscv/rv40dsp_rvv.S +++

[FFmpeg-devel] [PATCH v2_2 3/6] lavc/riscv: Move VVC macro to h26x

2024-12-15 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/h26x/asm.S | 127 ++ libavcodec/riscv/vvc/vvc_mc_rvv.S | 117 ++- 2 files changed, 132 insertions(+), 112 deletions(-) create mode 100644 libavcodec/riscv/h26x/asm.S diff --git a/libavcodec/riscv/h26

[FFmpeg-devel] [PATCH v2_2 2/6] lavc/vvc_mc: R-V V dmvr

2024-12-15 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 dmvr_8_12x20_c: 619.3 ( 1.00x)624.1 ( 1.00x) dmvr_8_12x20_rvv_i32: 128.6 ( 4.82x)103.4 ( 6.04x) dmvr_8_20x12_c: 610.0 ( 1.00x)665.6 ( 1.00x) dm

[FFmpeg-devel] [PATCH v2_2 1/6] Update R-V V vvc_mc vset to support more lengths

2024-12-15 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/vvc/vvc_mc_rvv.S | 46 +++ 1 file changed, 23 insertions(+), 23 deletions(-) diff --git a/libavcodec/riscv/vvc/vvc_mc_rvv.S b/libavcodec/riscv/vvc/vvc_mc_rvv.S index 45f4750f82..18532616d9 100644 --- a/libavcodec/riscv/vvc/vvc_mc

[FFmpeg-devel] [PATCH v2_2 6/6] lavc/vvc_mc R-V V sad

2024-12-15 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 sad_8x16_c: 387.7 ( 1.00x)394.9 ( 1.00x) sad_8x16_rvv_i32: 109.7 ( 3.53x)103.5 ( 3.82x) sad_16x8_c: 378.2 ( 1.00x)384.7 ( 1.00x) sad_16x8_rvv_i32:82.0 ( 4.61x

[FFmpeg-devel] [PATCH v2_2 4/6] lavc/vvc_mc: R-V V put_pixels

2024-12-15 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 put_chroma_pixels_8_4x4_c: 63.5 ( 1.00x)59.2 ( 1.00x) put_chroma_pixels_8_4x4_rvv_i32:26.5 ( 2.39x)28.0 ( 2.12x) put_chroma_pixels_8_8

[FFmpeg-devel] [PATCH v2_2 5/6] lavc/hevc: R-V V put_pixels(pow2)

2024-12-15 Thread uk7b-at-foxmail . com
From: sunyuechi k230 banana_f3 put_hevc_pel_pixels4_8_c: 61.6 ( 1.00x)69.5 ( 1.00x) put_hevc_pel_pixels4_8_rvv_i32: 24.6 ( 2.50x)28.0 ( 2.48x) put_hevc_pel_pixels8_8_c: 209.8 ( 1.00x)215.5 ( 1.00x)

[FFmpeg-devel] [PATCH] lavc/riscv: vset macro for simplify if-else

2024-12-16 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/h26x/asm.S | 78 + 1 file changed, 18 insertions(+), 60 deletions(-) diff --git a/libavcodec/riscv/h26x/asm.S b/libavcodec/riscv/h26x/asm.S index d37b459f66..d99690d9a0 100644 --- a/libavcodec/riscv/h26x/asm.S +++ b/libavc

[FFmpeg-devel] [PATCH v3 2/2] lavc/rv40dsp: fix RISC-V chroma_mc

2024-12-05 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/rv40dsp_rvv.S | 116 ++--- 1 file changed, 78 insertions(+), 38 deletions(-) diff --git a/libavcodec/riscv/rv40dsp_rvv.S b/libavcodec/riscv/rv40dsp_rvv.S index ca431eb8ab..d4beb7f1e9 100644 --- a/libavcodec/riscv/rv40dsp_rvv.S +++

[FFmpeg-devel] [PATCH v3 1/2] checkasm/rv40dsp: cover more cases

2024-12-05 Thread uk7b-at-foxmail . com
From: sunyuechi Co-Authored-By: Ronald S. Bultje --- tests/checkasm/rv40dsp.c | 10 +- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/tests/checkasm/rv40dsp.c b/tests/checkasm/rv40dsp.c index a1a873d430..c0d02ec81f 100644 --- a/tests/checkasm/rv40dsp.c +++ b/tests/checkas

[FFmpeg-devel] [PATCH] lavc/vvc_mc: reduce sequential dependency in R-V V sad

2024-12-21 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/vvc/vvc_sad_rvv.S | 10 +- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/libavcodec/riscv/vvc/vvc_sad_rvv.S b/libavcodec/riscv/vvc/vvc_sad_rvv.S index 341167be1f..f325deee17 100644 --- a/libavcodec/riscv/vvc/vvc_sad_rvv.S +++ b/libavc

[FFmpeg-devel] [PATCH] Fix the tail handling in R-V V sad

2024-12-23 Thread uk7b-at-foxmail . com
From: sunyuechi --- libavcodec/riscv/h26x/asm.S| 36 +- libavcodec/riscv/vvc/sad_rvv.S | 2 +- 2 files changed, 19 insertions(+), 19 deletions(-) diff --git a/libavcodec/riscv/h26x/asm.S b/libavcodec/riscv/h26x/asm.S index d99690d9a0..0c9ebbdaad 100644 --- a