from:"Rémi Denis\-Courmont"

Re: [FFmpeg-devel] [PATCH v4 3/5] avformat/network: add ff_neterror2() for cases where we already have an error

2024-05-16 Thread Rémi Denis-Courmont

Le 16 mai 2024 12:33:36 GMT+03:00, Andrew Sayers a écrit : >For example, WSAStartup()'s documentation says: > >"A call to the WSAGetLastError function is not needed and should not be > used" >--- > libavformat/network.c | 5 - > libavformat/network.h | 12 > 2 files changed

Re: [FFmpeg-devel] [PATCH] checkasm: add sample argument to adjust during bench

2024-05-16 Thread Rémi Denis-Courmont

Le torstaina 16. toukokuuta 2024, 16.09.49 EEST J. Dekker a écrit : > Some timers on certain device and test combinations can produce noisy > results, affecting the reliability of performance measurements. One > notable example of this is the Canaan K230 RISC-V development board. > > An option to

[FFmpeg-devel] [PATCHv4 1/4] lavu/riscv: add assembler macros for adjusting vector LMUL

2024-05-16 Thread Rémi Denis-Courmont

vtype_vli computes the VTYPE value with the optimal LMUL for a given element width, tail and mask policies and a run-time vector length. vtype_ivli does the same, but with the compile-time constant vector length. vwtypei and vntypei can be used to widen or narrow a VTYPE value for use in mixed-wi

[FFmpeg-devel] [PATCHv4 2/4] lavc/flacdsp: optimise RVV vector type for lpc16

2024-05-16 Thread Rémi Denis-Courmont

This calculates the optimal vector type value at run-time based on the hardware vector length and the FLAC LPC prediction order. In this particular case, the additional computation is easily amortised over the loop iterations: T-Head C908: C V before V after 1 48.0214.7 95.2 2

[FFmpeg-devel] [PATCHv4 3/4] lavc/flacdsp: optimise RVV vector type for lpc32

2024-05-16 Thread Rémi Denis-Courmont

This is pretty much the same as for lpc16, though it only improves half as large prediction orders. With 128-bit vectors, this gives: C V old V new 1 69.2 181.5 95.5 2 107.7 180.7 95.2 3 145.5 180.0 103.5 4 183.0 179.2 102.7 5 220.7 178.5 128.0 6 257.7 194.0 127.5 7

[FFmpeg-devel] [PATCHv4 4/4] lavc/huffyuvdsp: optimise RVV vtype for add_hfyu_left_pred_bgr32

2024-05-16 Thread Rémi Denis-Courmont

T-Head C908: add_hfyu_left_pred_bgr32_c: 237.5 add_hfyu_left_pred_bgr32_rvv_i32: 173.5 (before) add_hfyu_left_pred_bgr32_rvv_i32: 110.0 (after) --- libavcodec/riscv/huffyuvdsp_rvv.S | 6 -- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/libavcodec/riscv/huffyuvdsp_rvv.S b

Re: [FFmpeg-devel] [PATCHv4 4/4] lavc/huffyuvdsp: optimise RVV vtype for add_hfyu_left_pred_bgr32

2024-05-16 Thread Rémi Denis-Courmont

Le torstaina 16. toukokuuta 2024, 19.48.40 EEST Rémi Denis-Courmont a écrit : > T-Head C908: > add_hfyu_left_pred_bgr32_c: 237.5 > add_hfyu_left_pred_bgr32_rvv_i32: 173.5 (before) > add_hfyu_left_pred_bgr32_rvv_i32: 110.0 (after) > --- > libavcodec/riscv/huffyuvdsp_rvv

Re: [FFmpeg-devel] [PATCH 3/6] lavf/tls_mbedtls: hook up debug message callback

2024-05-17 Thread Rémi Denis-Courmont

Le 17 mai 2024 11:34:35 GMT+03:00, Sfan5 a écrit : >Signed-off-by: sfan5 >--- > libavformat/tls_mbedtls.c | 14 ++ > 1 file changed, 14 insertions(+) > >diff --git a/libavformat/tls_mbedtls.c b/libavformat/tls_mbedtls.c >index 24c3afd94c..9508fe3436 100644 >--- a/libavformat/tls_mbed

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-17 Thread Rémi Denis-Courmont

Le 17 mai 2024 16:49:58 GMT+03:00, Michael Niedermayer a écrit : >Hi all > >Before this is forgotten again, better start some dicsussion too early than >too late > >I propose that if we have the oppertunity again next year to receive a grant >from STF. That we use it to fund: > >* Paul to work

Re: [FFmpeg-devel] [PATCH v3 5/9] lavc/vp9dsp: R-V V mc avg

2024-05-17 Thread Rémi Denis-Courmont

Le maanantaina 13. toukokuuta 2024, 19.59.22 EEST u...@foxmail.com a écrit : > From: sunyuechi > > C908: > vp9_avg4_8bpp_c: 1.2 > vp9_avg4_8bpp_rvv_i64: 1.0 > vp9_avg8_8bpp_c: 3.7 > vp9_avg8_8bpp_rvv_i64: 1.5 > vp9_avg16_8bpp_c: 14.7 > vp9_avg16_8bpp_rvv_i64: 3.5 > vp9_avg32_8bpp_c: 57.7 > vp9_av

[FFmpeg-devel] [PATCH 1/1] lavc/vc1dsp: R-V V vc1_unescape_buffer

2024-05-17 Thread Rémi Denis-Courmont

vcodec/riscv/vc1dsp_rvv.S +++ b/libavcodec/riscv/vc1dsp_rvv.S @@ -1,5 +1,6 @@ /* * Copyright (c) 2023 Institue of Software Chinese Academy of Sciences (ISCAS). + * Copyright (c) 2024 Rémi Denis-Courmont. * * This file is part of FFmpeg. * @@ -159,3 +160,55 @@ func ff_avg_pixels8x8_rvv,

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-17 Thread Rémi Denis-Courmont

Le 17 mai 2024 21:50:20 GMT+03:00, Michael Niedermayer a écrit : >Let me reword this for 2024: >The people on the booth are predominantly male. Similarly ffmpeg-devel >is predominantly male. More gender diversity would be good. As you've noted, this is an obvious problem but not an FFmpeg-spec

[FFmpeg-devel] [PATCH 1/2] lavc/startcode: add R-V V startcode_find_candidate

2024-05-18 Thread Rémi Denis-Courmont

tcode_find_candidate_rvv; +# endif #endif } diff --git a/libavcodec/riscv/startcode_rvv.S b/libavcodec/riscv/startcode_rvv.S new file mode 100644 index 00..7c43b1d7f3 --- /dev/null +++ b/libavcodec/riscv/startcode_rvv.S @@ -0,0 +1,44 @@ +/* + * Copyright © 2024 Rémi Denis-Courmont. + * + * R

[FFmpeg-devel] [PATCH 2/2] lavc/vp8dsp: fix .irp use with LLVM as

2024-05-18 Thread Rémi Denis-Courmont

--- libavcodec/riscv/vp8dsp_rvv.S | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/libavcodec/riscv/vp8dsp_rvv.S b/libavcodec/riscv/vp8dsp_rvv.S index 4d7a9f6a2d..0ba9fa443d 100644 --- a/libavcodec/riscv/vp8dsp_rvv.S +++ b/libavcodec/riscv/vp8dsp_rvv.S @@ -171,7 +171,7 @@ en

Re: [FFmpeg-devel] [PATCH 1/2] lavc/startcode: add R-V V startcode_find_candidate

2024-05-18 Thread Rémi Denis-Courmont

Wrong patch sent, disregard -- Rémi Denis-Courmont http://www.remlab.net/ ___ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ

[FFmpeg-devel] [PATCH] lavc/vp9_intra: fix .irp use with LLVM as

2024-05-18 Thread Rémi Denis-Courmont

--- libavcodec/riscv/vp9_intra_rvv.S | 28 ++-- 1 file changed, 14 insertions(+), 14 deletions(-) diff --git a/libavcodec/riscv/vp9_intra_rvv.S b/libavcodec/riscv/vp9_intra_rvv.S index 280c497687..7b3c22b99a 100644 --- a/libavcodec/riscv/vp9_intra_rvv.S +++ b/libavcodec/ri

Re: [FFmpeg-devel] [PATCH v3 6/9] lavc/vp9dsp: R-V V mc bilin h v

2024-05-18 Thread Rémi Denis-Courmont

Le maanantaina 13. toukokuuta 2024, 19.59.23 EEST u...@foxmail.com a écrit : > From: sunyuechi > > C908: > vp9_avg_bilin_4h_8bpp_c: 5.2 > vp9_avg_bilin_4h_8bpp_rvv_i64: 2.2 > vp9_avg_bilin_4v_8bpp_c: 5.5 > vp9_avg_bilin_4v_8bpp_rvv_i64: 2.2 > vp9_avg_bilin_8h_8bpp_c: 20.0 > vp9_avg_bilin_8h_8bpp_

[FFmpeg-devel] [PATCH] checkasm: add h263dsp.{h,v}_loop_filter

2024-05-18 Thread Rémi Denis-Courmont

_check_h264chroma(void); void checkasm_check_h264dsp(void); void checkasm_check_h264pred(void); diff --git a/tests/checkasm/h263dsp.c b/tests/checkasm/h263dsp.c new file mode 100644 index 00..2d0957a90b --- /dev/null +++ b/tests/checkasm/h263dsp.c @@ -0,0 +1,62 @@ +/* + * Copyright (c) 2024 Ré

Re: [FFmpeg-devel] [PATCH v3 2/2] avcodec: add external dec libvvdec for H266/VVC

2024-05-18 Thread Rémi Denis-Courmont

Hi, Le 18 mai 2024 21:55:04 GMT+03:00, Cosmin Stejerean via ffmpeg-devel a écrit : > > >> On May 18, 2024, at 7:04 AM, Nuo Mi wrote: >> >> This happened many years ago. See the discussion here: >> https://patchwork.ffmpeg.org/project/ffmpeg/patch/20201221060710.12230-6-nuomi2...@gmail.com/#605

Re: [FFmpeg-devel] [PATCH] tests/ref/fate/source: Add exceptions for riscv startcode files

2024-05-19 Thread Rémi Denis-Courmont

Le sunnuntaina 19. toukokuuta 2024, 11.49.52 EEST Andreas Rheinhardt a écrit : > Fixes fate-source. > > Signed-off-by: Andreas Rheinhardt > --- > An alternative would be to change the license header, but this would > have to be done by courmisch himself. A better alternative would be to fix the

[FFmpeg-devel] [PATCH] lavc/h263dsp: R-V V {h,v}_loop_filter

2024-05-19 Thread Rémi Denis-Courmont

git a/libavcodec/riscv/h263dsp_init.c b/libavcodec/riscv/h263dsp_init.c new file mode 100644 index 00..21b536366c --- /dev/null +++ b/libavcodec/riscv/h263dsp_init.c @@ -0,0 +1,41 @@ +/* + * Copyright © 2022 Rémi Denis-Courmont. + * + * This file is part of FFmpeg. + * + * FFmpeg is free soft

Re: [FFmpeg-devel] [PATCH] avutil/hwcontext_drm: Check ioctl in drm_map_frame() for failure

2024-05-20 Thread Rémi Denis-Courmont

Le 20 mai 2024 12:33:41 GMT+03:00, Andreas Rheinhardt a écrit : >Michael Niedermayer: >> Fixes: CID1583742 Unchecked return value >> >> Sponsored-by: Sovereign Tech Fund >> Signed-off-by: Michael Niedermayer >> --- >> libavutil/hwcontext_drm.c | 5 - >> 1 file changed, 4 insertions(+), 1

Re: [FFmpeg-devel] [PATCH v3 2/2] avcodec: add external dec libvvdec for H266/VVC

2024-05-20 Thread Rémi Denis-Courmont

Le maanantaina 20. toukokuuta 2024, 19.33.43 EEST Cosmin Stejerean via ffmpeg- devel a écrit : > > Adding a disabled-by-default decoder is adding bloat, if there is not a > > specific known reason why that is needed. > > This is especially true for video decoders, where the external decoders > >

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-20 Thread Rémi Denis-Courmont

Le sunnuntaina 19. toukokuuta 2024, 14.29.43 EEST Thilo Borgmann via ffmpeg- devel a écrit : > [...] > > >> * Fund administrative / maintainance work (one example is the mailman > >> upgrade that is needed>> > >> with the next OS upgrade on one of our servers (this is not as trivial > >> as o

Re: [FFmpeg-devel] [PATCH v3 2/2] avcodec: add external dec libvvdec for H266/VVC

2024-05-20 Thread Rémi Denis-Courmont

Le maanantaina 20. toukokuuta 2024, 21.39.18 EEST Cosmin Stejerean via ffmpeg- devel a écrit : > The same way using FDK-AAC as a decoder works, if you want to use it as the > *AAC decoder you have to specify the decoder with -c:a libfdk_aac before > the input.- AFAIK, we don't have hwaccel for aud

Re: [FFmpeg-devel] [PATCH v3 2/2] avcodec: add external dec libvvdec for H266/VVC

2024-05-20 Thread Rémi Denis-Courmont

Le maanantaina 20. toukokuuta 2024, 22.33.45 EEST Cosmin Stejerean via ffmpeg- devel a écrit : > > And again, you can't expect users to select decoders manually. If vvdec is > > the > > default, hwaccel won't work. If vvdec is not the default, then it's > > dead code. > > Not sure why you keep re

Re: [FFmpeg-devel] [PATCH v4 2/2][GSoC 2024] tests/checkasm: Add check_vvc_sad to vvc_mc.c

2024-05-20 Thread Rémi Denis-Courmont

Hi, Le 20 mai 2024 03:42:03 GMT+03:00, Stone Chen a écrit : >Adds checkasm for DMVR SAD AVX2 implementation. > >Benchmarks ( AMD 7940HS ) >vvc_sad_8x8_c: 70.0 >vvc_sad_8x8_avx2: 10.0 >vvc_sad_16x16_c: 280.0 >vvc_sad_16x16_avx2: 20.0 >vvc_sad_32x32_c: 1020.0 >vvc_sad_32x32_avx2: 70.0 >vvc_sad_64x

Re: [FFmpeg-devel] [PATCH v4 2/2][GSoC 2024] tests/checkasm: Add check_vvc_sad to vvc_mc.c

2024-05-21 Thread Rémi Denis-Courmont

Le 21 mai 2024 09:37:18 GMT+03:00, "Martin Storsjö" a écrit : >On Tue, 21 May 2024, Rémi Denis-Courmont wrote: > >> Hi, >> >> Le 20 mai 2024 03:42:03 GMT+03:00, Stone Chen a >> écrit : >>> Adds checkasm for DMVR SAD AVX2 implementation. >

Re: [FFmpeg-devel] [PATCH] checkasm: vvc_alf: Limit benchmarking to a reasonable subset of functions

2024-05-21 Thread Rémi Denis-Courmont

Le 21 mai 2024 13:04:29 GMT+03:00, "Martin Storsjö" a écrit : >Don't benchmark every single combination of widths and heights; >only benchmark cases which are squares (like in vvc_mc.c). > >Contrary to vvc_mc, which increases sizes by doubling dimensions, >vvc_alf tests all sizes in increments o

Re: [FFmpeg-devel] [PATCH] checkasm: h264dsp: Avoid out of buffer writes when benchmarking

2024-05-21 Thread Rémi Denis-Courmont

Le 21 mai 2024 14:04:53 GMT+03:00, "Martin Storsjö" a écrit : >The loop filters can write before the pointer given to them; >the actual test invocations correctly used an offset, while >the benchmark calls were lacking an offset. Therefore, when >running with benchmarking, these tests could have

Re: [FFmpeg-devel] [PATCH 3/6] lavf/tls_mbedtls: hook up debug message callback

2024-05-21 Thread Rémi Denis-Courmont

Le 21 mai 2024 13:14:59 GMT+03:00, sfan5 a écrit : >Am 17.05.24 um 11:51 schrieb Rémi Denis-Courmont: >> Le 17 mai 2024 11:34:35 GMT+03:00, Sfan5 a écrit : >>> Signed-off-by: sfan5 >>> --- >>> libavformat/tls_mbedtls.c | 14 ++ >>> 1

Re: [FFmpeg-devel] [PATCH v4 1/5] lavc/vp9dsp: R-V V mc avg

2024-05-21 Thread Rémi Denis-Courmont

\ > +dsp->mc[idx1][FILTER_8TAP_SMOOTH ][1][0][0] = ff_avg##sz##_rvv; \ > +dsp->mc[idx1][FILTER_8TAP_REGULAR][1][0][0] = ff_avg##sz##_rvv; \ > +dsp->mc[idx1][FILTER_8TAP_SHARP ][1][0][0] = ff_avg##sz##_rvv; \ > +dsp->mc[idx1][FILTER_B

Re: [FFmpeg-devel] [PATCH v4 1/5] lavc/vp9dsp: R-V V mc avg

2024-05-21 Thread Rémi Denis-Courmont

his. All non-static symbols in libavcodec are in the same namespace. > Many macros in the C file are copied from MIPS, where > it is called ff_avg4_msa. Here, it has been simply changed to ff_avg4_rvv. AArch64 uses ff_vp9_avg which seems a lot saner. -- Rémi Den

Re: [FFmpeg-devel] [PATCH] lavc/vvc_mc: R-V V avg w_avg

2024-05-21 Thread Rémi Denis-Courmont

t_riscv(VVCDSPContext *const c, const int bd) > +{ > +#if HAVE_RVV > +const int flags = av_get_cpu_flags(); > + > +if ((flags & AV_CPU_FLAG_RVV_I32) && (flags & AV_CPU_FLAG_RVB_ADDR) && > +ff_rv_vlen_least(256)) { > +switch (bd) { >

[FFmpeg-devel] [PATCH 2/2] lavc/pixblockdsp: add scalar get_pixels_unaligned

2024-05-21 Thread Rémi Denis-Courmont

The code is already there, we just need to use it. get_pixels_unaligned_c: 2.2 get_pixels_unaligned_misaligned: 1.7 --- libavcodec/riscv/pixblockdsp_init.c | 7 +++ 1 file changed, 7 insertions(+) diff --git a/libavcodec/riscv/pixblockdsp_init.c b/libavcodec/riscv/pixblockdsp_init.c index b

[FFmpeg-devel] [PATCH 1/2] checkasm/riscv: test misaligned before V

2024-05-21 Thread Rémi Denis-Courmont

Otherwise V functions mask scalar misaligned ones. --- tests/checkasm/checkasm.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/tests/checkasm/checkasm.c b/tests/checkasm/checkasm.c index 31ca9f6e2b..76835ab267 100644 --- a/tests/checkasm/checkasm.c +++ b/tests/checkasm/checka

Re: [FFmpeg-devel] [PATCH v2 2/5] lavc/vp9dsp: R-V V mc bilin h v

2024-05-21 Thread Rémi Denis-Courmont

vv.S > @@ -53,6 +53,49 @@ func ff_vp9_avg\len\()_rvv, zve32x > endfunc > .endm > > +.macro bilin_load dst len op type mn Commas, please. -- Rémi Denis-Courmont http://www.remlab.net/ ___ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-21 Thread Rémi Denis-Courmont

Le tiistaina 21. toukokuuta 2024, 21.43.44 EEST Thilo Borgmann via ffmpeg-devel a écrit : > >> Same as above about that we should and STF would. > >> Especially since no corporate interest usually pays anyone for these > >> tasks > > > > Sadly true, but... > > > >> (in case of reviews it might o

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-21 Thread Rémi Denis-Courmont

Le tiistaina 21. toukokuuta 2024, 22.42.00 EEST Rémi Denis-Courmont a écrit : > And "I hope you realise that you are arguing for" Intel, Loongson, etc. > employees to stop reviewing patches. P.S.: And FFlabs too, since it is a for-profit company. -- レミ・デニ-クールモン http:

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-22 Thread Rémi Denis-Courmont

Le 22 mai 2024 00:34:03 GMT+03:00, Thilo Borgmann via ffmpeg-devel a écrit : >>> I hope you realize what you argue in favor of. >> >> Yes. It's quoted above. >> >> Are you claiming that *no* review is better than *some* review done in >> *public* for all to see by a paid professional just bec

Re: [FFmpeg-devel] [RFC PATCH 2/2] tests/fate/source-check: Relax BSD licence check

2024-05-22 Thread Rémi Denis-Courmont

Le 22 mai 2024 14:04:50 GMT+03:00, Andreas Rheinhardt a écrit : >Several files already had standard license header (namely >2-clause BSD files), yet due to the 80 char line length limit, >they were not treated as such by source-check.sh (which >fate-source uses). Therefore relax the BSD check.

[FFmpeg-devel] [PATCH] lavc/rv34dsp: optimise R-V V idct_dc_add

2024-05-22 Thread Rémi Denis-Courmont

This removes one stray LI and reworks the vector arithmetic to avoid changing the vector configuration. On K230, this takes the 46.5 cycle count down from 46.5 to 43.5. --- libavcodec/riscv/rv34dsp_rvv.S | 13 ++--- 1 file changed, 6 insertions(+), 7 deletions(-) diff --git a/libavcodec/r

Re: [FFmpeg-devel] [PATCH] lavc/rv34dsp: optimise R-V V idct_dc_add

2024-05-22 Thread Rémi Denis-Courmont

Le 22 mai 2024 23:28:54 GMT+03:00, "Rémi Denis-Courmont" a écrit : >This removes one stray LI and reworks the vector arithmetic to avoid >changing the vector configuration. On K230, this takes the 46.5 cycle >count down from 46.5 to 43.5. >--- > libavcodec

Re: [FFmpeg-devel] [PATCH] lavc/rv34dsp: optimise R-V V idct_dc_add

2024-05-23 Thread Rémi Denis-Courmont

there :-( FWIW, beware that people will tend to assume that you are a professional if you assign copyright to a legal entity, even if there is another nonobvious reason why you do that. >Rémi Denis-Courmont 于2024年5月23日周四 04:35写道： > >> >> >> Le 22 mai 2024 23:28:54 GMT+0

[FFmpeg-devel] [PATCH] lavc/rv32dsp: remove stray load immediate

2024-05-23 Thread Rémi Denis-Courmont

--- libavcodec/riscv/rv34dsp_rvv.S | 1 - 1 file changed, 1 deletion(-) diff --git a/libavcodec/riscv/rv34dsp_rvv.S b/libavcodec/riscv/rv34dsp_rvv.S index f1f6345012..8eda01665d 100644 --- a/libavcodec/riscv/rv34dsp_rvv.S +++ b/libavcodec/riscv/rv34dsp_rvv.S @@ -37,7 +37,6 @@ func ff_rv34_idct_dc

[FFmpeg-devel] [PATCH] lavc/vp8dsp: remove unused macro parameter

2024-05-23 Thread Rémi Denis-Courmont

--- libavcodec/riscv/vp8dsp_rvv.S | 8 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/libavcodec/riscv/vp8dsp_rvv.S b/libavcodec/riscv/vp8dsp_rvv.S index 0ba9fa443d..9c84b1503e 100644 --- a/libavcodec/riscv/vp8dsp_rvv.S +++ b/libavcodec/riscv/vp8dsp_rvv.S @@ -94,7 +94,7 @@

Re: [FFmpeg-devel] [RFC] STF 2025

2024-05-24 Thread Rémi Denis-Courmont

Le 24 mai 2024 10:11:49 GMT+03:00, Thilo Borgmann via ffmpeg-devel a écrit : >Unfortunately true, yet you argue to pay more companies to do reviews instead >having reviews funded by unbiased means. That is a blatant and libelous lie and will be reported to CC promptly. ___

Re: [FFmpeg-devel] [PATCH] lavc/rv32dsp: remove stray load immediate

2024-05-24 Thread Rémi Denis-Courmont

Le 24 mai 2024 09:44:50 GMT+03:00, Andreas Rheinhardt a écrit : >Rémi Denis-Courmont: >> --- >> libavcodec/riscv/rv34dsp_rvv.S | 1 - >> 1 file changed, 1 deletion(-) >> >> diff --git a/libavcodec/riscv/rv34dsp_rvv.S b/libavcodec/riscv/rv34dsp_rvv.S >&

[FFmpeg-devel] [PATCH 1/2] checkasm/vp8dsp: share DSP context across tests

2024-05-24 Thread Rémi Denis-Courmont

This will simplify later changes. --- tests/checkasm/vp8dsp.c | 82 + 1 file changed, 34 insertions(+), 48 deletions(-) diff --git a/tests/checkasm/vp8dsp.c b/tests/checkasm/vp8dsp.c index 6e989819ef..12d2f62f2d 100644 --- a/tests/checkasm/vp8dsp.c +++ b/te

[FFmpeg-devel] [PATCH 2/2] checkasm/vp8dsp: add VP7 tests

2024-05-24 Thread Rémi Denis-Courmont

--- tests/checkasm/vp8dsp.c | 57 ++--- 1 file changed, 36 insertions(+), 21 deletions(-) diff --git a/tests/checkasm/vp8dsp.c b/tests/checkasm/vp8dsp.c index 12d2f62f2d..e448fe89fc 100644 --- a/tests/checkasm/vp8dsp.c +++ b/tests/checkasm/vp8dsp.c @@ -18,8 +18

[FFmpeg-devel] [PATCH] lavc/flacdsp: do not assume maximum R-V VL

2024-05-24 Thread Rémi Denis-Courmont

This loop correctly assumes that VLMAX=16 (4x128-bit vectors with 32-bit elements) and 32 >= pred_order > 16. We need to alternate between VL=16 and VL=t2=pred_order-16 elements to add up to pred_order. The current code requests AVL=a2=pred_order elements. In QEMU and on thte K230 hardware, this s

[FFmpeg-devel] [PATCH] WIP: lavc/flacdsp: R-V Zvl256b lpc33

2024-05-24 Thread Rémi Denis-Courmont

flac_lpc_33_13_c: 499.7 flac_lpc_33_13_rvv_i64: 197.7 flac_lpc_33_16_c: 601.5 flac_lpc_33_16_rvv_i64: 195.2 flac_lpc_33_29_c: 1011.5 flac_lpc_33_29_rvv_i64: 300.7 flac_lpc_33_32_c: 1099.0 flac_lpc_33_32_rvv_i64: 296.7 --- libavcodec/riscv/flacdsp_init.c | 7 +-- libavcodec/riscv/flacdsp_rvv.S

Re: [FFmpeg-devel] [PATCH] WIP: lavc/flacdsp: R-V Zvl256b lpc33

2024-05-24 Thread Rémi Denis-Courmont

Le perjantaina 24. toukokuuta 2024, 23.04.40 EEST Rémi Denis-Courmont a écrit : > flac_lpc_33_13_c: 499.7 > flac_lpc_33_13_rvv_i64: 197.7 > flac_lpc_33_16_c: 601.5 > flac_lpc_33_16_rvv_i64: 195.2 > flac_lpc_33_29_c: 1011.5 > flac_lpc_33_29_rvv_i64: 300.7 > fla

Re: [FFmpeg-devel] [PATCH] lavc/vvc_mc: R-V V avg w_avg

2024-05-25 Thread Rémi Denis-Courmont

Hi, Sorry for the delay, this is not my day job. Le tiistaina 21. toukokuuta 2024, 22.24.21 EEST u...@foxmail.com a écrit : > From: sunyuechi > --- > libavcodec/riscv/Makefile | 2 + > libavcodec/riscv/vvc_mc_rvv.S | 313 + > libavcodec/riscv/vvcd

Re: [FFmpeg-devel] [PATCH v2 3/5] lavc/vp9dsp: R-V V mc tap h v

2024-05-25 Thread Rémi Denis-Courmont

nit_fpel > + > +#define init_subpel1(idx1, idx2, idxh, idxv, sz, dir, type, vlen) \ > +dsp->mc[idx1][FILTER_8TAP_SMOOTH ][idx2][idxh][idxv] = \ > +ff_##type##_vp9_8tap_smooth_##sz##dir##_rvv##vlen; \ > +dsp->mc[idx1][FILTER_8TAP_REGULAR][idx2][idxh][idxv] = \ > +

Re: [FFmpeg-devel] [PATCH 1/3] avformat/file: check for lseek() failure

2024-05-25 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 14.18.12 EEST Michael Niedermayer a écrit : > Fixes: CID1559855 Unchecked return value from library This is again very misleading. Logging an error (that most people won't even see) is not fixing a bug. Either this is not a bug, and this patch can't be fixing a

[FFmpeg-devel] [PATCH] checkasm/lpc: test compute_autocorr

2024-05-25 Thread Rémi Denis-Courmont

-- Change since previous version: - Disable test on x86, since the x86 optimisation fails with a suspiciously large and regular error with 10-20% probability. --- tests/checkasm/lpc.c | 54 ++-- 1 file changed, 52 insertions(+), 2 deletions(-) diff --git

Re: [FFmpeg-devel] [PATCH 1/3] avformat/file: check for lseek() failure

2024-05-25 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 15.19.51 EEST Michael Niedermayer a écrit : > On Sat, May 25, 2024 at 02:38:07PM +0300, Rémi Denis-Courmont wrote: > > Le lauantaina 25. toukokuuta 2024, 14.18.12 EEST Michael Niedermayer a écrit : > > > Fixes: CID1559855 Unchecked return

[FFmpeg-devel] [PATCH 2/5] lavc/vp8dsp: expand single use R-V macros

2024-05-25 Thread Rémi Denis-Courmont

--- libavcodec/riscv/vp8dsp_rvv.S | 24 ++-- 1 file changed, 6 insertions(+), 18 deletions(-) diff --git a/libavcodec/riscv/vp8dsp_rvv.S b/libavcodec/riscv/vp8dsp_rvv.S index cb9b0b8b5f..bb0c7bf02a 100644 --- a/libavcodec/riscv/vp8dsp_rvv.S +++ b/libavcodec/riscv/vp8dsp_rvv.S

[FFmpeg-devel] [PATCH 1/5] lavc/vp8dsp: avoid one multiplication on RISC-V

2024-05-25 Thread Rémi Denis-Courmont

Use shifts rather than multiply, and save one instruction. --- libavcodec/riscv/vp8dsp_init.c | 26 ++ libavcodec/riscv/vp8dsp_rvv.S | 7 +++ 2 files changed, 17 insertions(+), 16 deletions(-) diff --git a/libavcodec/riscv/vp8dsp_init.c b/libavcodec/riscv/vp8dsp_init

[FFmpeg-devel] [PATCH 4/5] lavc/vp8dsp: save one R-V GPR

2024-05-25 Thread Rémi Denis-Courmont

This saves one instruction and frees up A5, which will be repurposed in later changes. Unfortunately, we need to add quite a lot of alternative code for this. --- libavcodec/riscv/vp8dsp_rvv.S | 24 1 file changed, 16 insertions(+), 8 deletions(-) diff --git a/libavcodec/

[FFmpeg-devel] [PATCH 3/5] lavc/vp8dsp: factor R-V V bilin functions

2024-05-25 Thread Rémi Denis-Courmont

For a given type, only the first VSETVLI instruction varies depending on the size. --- libavcodec/riscv/vp8dsp_rvv.S | 37 +-- 1 file changed, 27 insertions(+), 10 deletions(-) diff --git a/libavcodec/riscv/vp8dsp_rvv.S b/libavcodec/riscv/vp8dsp_rvv.S index bb0c7bf

[FFmpeg-devel] [PATCH 5/5] lavc/vp8dsp: factor R-V V EPEL functions for all lengths

2024-05-25 Thread Rémi Denis-Courmont

--- libavcodec/riscv/vp8dsp_rvv.S | 56 --- 1 file changed, 32 insertions(+), 24 deletions(-) diff --git a/libavcodec/riscv/vp8dsp_rvv.S b/libavcodec/riscv/vp8dsp_rvv.S index a4fcd158a5..002e7f3174 100644 --- a/libavcodec/riscv/vp8dsp_rvv.S +++ b/libavcodec/riscv/v

Re: [FFmpeg-devel] [PATCH] checkasm: add h263dsp.{h,v}_loop_filter

2024-05-25 Thread Rémi Denis-Courmont

Le lauantaina 18. toukokuuta 2024, 21.44.10 EEST Rémi Denis-Courmont a écrit : > --- > tests/checkasm/Makefile | 1 + > tests/checkasm/checkasm.c | 3 ++ > tests/checkasm/checkasm.h | 1 + > tests/checkasm/h263dsp.c | 62 +++ > tests

Re: [FFmpeg-devel] [PATCH v2 3/5] lavc/vp9dsp: R-V V mc tap h v

2024-05-25 Thread Rémi Denis-Courmont

C uses 16-bit values for no apparent reasons. We should probably change the C code to use 8- bit on all platforms - except AArch64, because the NEON code probably relies on 16-bit format. -- Rémi Denis-Courmont http://www.remlab.net/ ___ ffmpeg

[FFmpeg-devel] [PATCH] lavc/sbrdsp: fold immediate offset into relocation

2024-05-25 Thread Rémi Denis-Courmont

This results in AUIPC; ADDI instead of AUIPC; ADDI; ... ADDI. --- libavcodec/riscv/sbrdsp_rvv.S | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/libavcodec/riscv/sbrdsp_rvv.S b/libavcodec/riscv/sbrdsp_rvv.S index aba9a28108..331b88022c 100644 --- a/libavcodec/riscv/sbrdsp_rvv.

[FFmpeg-devel] [PATCH] lavc/sbrdsp: fix inverted boundary check

2024-05-25 Thread Rémi Denis-Courmont

128-it is the maximum, not the minimum here. Larger vector sizes can result in reads past the end of the noise value table. This partially reverts commit cdcb4b98b7f74d87a6274899ff70724795d551cb. --- libavcodec/riscv/sbrdsp_init.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git

Re: [FFmpeg-devel] [PATCH] lavc/sbrdsp: fix inverted boundary check

2024-05-25 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 20.52.44 EEST Rémi Denis-Courmont a écrit : > 128-it is the maximum, not the minimum here. Larger vector sizes can > result in reads past the end of the noise value table. Nit: bit > > This partially reverts commit cdcb4b98b7f74d87a6274899ff70

Re: [FFmpeg-devel] [PATCH 5/5] lavc/vp8dsp: factor R-V V EPEL functions for all lengths

2024-05-25 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 21.16.22 EEST flow gg a écrit : > Would it be better to replace the two vsetvlstatic8 and vsetvlstatic16 with > two vsetvl? The other option is to hard-code the most pessimistic multiplier. That would be easier to read and save two instructions in the head, it w

[FFmpeg-devel] [PATCH] lavc/sbrdsp: fix potential overflow in noise table

2024-05-25 Thread Rémi Denis-Courmont

Since the SBR noise application optimisations are currently restricted to hardware with 128-bit vectors, and use a quadruple multipler, they can load up to 16 32-bit elements. But the "loads" are of 2 segments, or 16 pairs of single precision float. Thus we need to expand the dupiclated section of

[FFmpeg-devel] [PATCH] lavc/sbrdsp: add support for 256-bit vectors

2024-05-25 Thread Rémi Denis-Courmont

hf_apply_noise_0_c: 35.7 hf_apply_noise_0_rvv_f32: 9.5 hf_apply_noise_1_c: 38.5 hf_apply_noise_1_rvv_f32: 10.0 hf_apply_noise_2_c: 35.5 hf_apply_noise_2_rvv_f32: 9.7 hf_apply_noise_3_c: 38.5 hf_apply_noise_3_rvv_f32: 10.0 Maybe extending the noise table manually is not such great idea, but I not q

Re: [FFmpeg-devel] [PATCH 5/5] lavc/vp8dsp: factor R-V V EPEL functions for all lengths

2024-05-25 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 21.56.58 EEST flow gg a écrit : > The reduction in code size seems to be due to switching to using j labels, > doesn't seem to be about vset, but another issue. j labels are indeed > better. I will make similar modifications. If we don't use vsetvl, then we have

[FFmpeg-devel] [PATCH] lavc/lpc: optimise RVV vector type for compute_autocorr

2024-05-25 Thread Rémi Denis-Courmont

On SpacemiT X60 (with len == 4000): autocorr_10_c: 2303.7 autocorr_10_rvv_f64: 1411.5 (before) autocorr_10_rvv_f64: 842.2 (after) --- libavcodec/riscv/lpc_init.c | 3 ++- libavcodec/riscv/lpc_rvv.S | 5 +++-- 2 files changed, 5 insertions(+), 3 deletions(-) diff --git a/libavcodec/riscv/l

Re: [FFmpeg-devel] [PATCH] lpc: rewrite lpc_compute_autocorr in external asm

2024-05-25 Thread Rémi Denis-Courmont

Le sunnuntaina 26. toukokuuta 2024, 1.31.18 EEST James Almer a écrit : > On 5/25/2024 5:57 PM, Lynne via ffmpeg-devel wrote: > > The inline asm function had issues running under checkasm. > > So I came to finish what I started, and wrote the last part > > of LPC computation in assembly. > > > > au

[FFmpeg-devel] [PATCH] lavc/vp8dsp: R-V V vp8_luma_dc_wht

2024-05-26 Thread Rémi Denis-Courmont

This is not great as transposition is poorly supported, but it works: vp8_luma_dc_wht_c: 2.5 vp8_luma_dc_wht_rvv_i32: 1.7 --- libavcodec/riscv/vp8dsp_init.c | 2 ++ libavcodec/riscv/vp8dsp_rvv.S | 55 ++ 2 files changed, 57 insertions(+) diff --git a/libavc

Re: [FFmpeg-devel] [PATCH] lavc/vp8dsp: R-V V vp8_luma_dc_wht

2024-05-26 Thread Rémi Denis-Courmont

Le sunnuntaina 26. toukokuuta 2024, 10.29.02 EEST Rémi Denis-Courmont a écrit : > diff --git a/libavcodec/riscv/vp8dsp_init.c b/libavcodec/riscv/vp8dsp_init.c > index 2413fbf449..d48fe08560 100644 > --- a/libavcodec/riscv/vp8dsp_init.c > +++ b/libavcodec/riscv/vp8dsp_init.c >

[FFmpeg-devel] [PATCH] lavc/vp7dsp: add R-V V vp7_luma_dc_wht

2024-05-26 Thread Rémi Denis-Courmont

-OBJS-$(CONFIG_VP8DSP) += riscv/vp8dsp_rvv.o diff --git a/libavcodec/riscv/vp7dsp_init.c b/libavcodec/riscv/vp7dsp_init.c new file mode 100644 index 00..6d9aba43d9 --- /dev/null +++ b/libavcodec/riscv/vp7dsp_init.c @@ -0,0 +1,41 @@ +/* + * Copyright (c) 2024 Rémi Denis-Courmont. + * + * This

[FFmpeg-devel] [PATCH 2/3] lavc/vp7dsp: revector ff_vp7_dc_wht_rvv

2024-05-26 Thread Rémi Denis-Courmont

This prepares for some code reuse. --- libavcodec/riscv/vp7dsp_init.c | 3 ++- libavcodec/riscv/vp7dsp_rvv.S | 23 ++- 2 files changed, 16 insertions(+), 10 deletions(-) diff --git a/libavcodec/riscv/vp7dsp_init.c b/libavcodec/riscv/vp7dsp_init.c index 5a57a39d0b..1406eb1c6a

[FFmpeg-devel] [PATCH 3/3] lavc/vp7dsp: R-V V vp7_idct_add

2024-05-26 Thread Rémi Denis-Courmont

Most of the code is shared with DC, thanks to minor earlier changes. vp7_idct_add_c: 5.2 vp7_idct_add_rvv_i32: 2.5 --- libavcodec/riscv/vp7dsp_init.c | 2 ++ libavcodec/riscv/vp7dsp_rvv.S | 29 + 2 files changed, 31 insertions(+) diff --git a/libavcodec/riscv/

[FFmpeg-devel] [PATCH] lavc/vp8dsp: disable EPEL HV on RV128

2024-05-26 Thread Rémi Denis-Courmont

RV128 is mostly scifi at this point, so we can just disable it here (the EPEL HV prologue/epilogue do not save 128-bit registers). --- libavcodec/riscv/vp8dsp_init.c | 3 ++- libavcodec/riscv/vp8dsp_rvv.S | 2 ++ 2 files changed, 4 insertions(+), 1 deletion(-) diff --git a/libavcodec/riscv/vp8ds

Re: [FFmpeg-devel] [PATCH] checkasm: add h263dsp.{h,v}_loop_filter

2024-05-26 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 19.30.44 EEST James Almer a écrit : > On 5/18/2024 3:44 PM, Rémi Denis-Courmont wrote: > > --- > > > > tests/checkasm/Makefile | 1 + > > tests/checkasm/checkasm.c | 3 ++ > > tests/checkasm/checkasm.h | 1 + >

Re: [FFmpeg-devel] [PATCH v2 1/5] lavc/vp9dsp: R-V V mc avg

2024-05-26 Thread Rémi Denis-Courmont

atches. You can revert the patch and then provide a better version in a patch series. However it may be prefereable to squash the revert and the improved version (and describe the improvements in the commit log). -- Rémi Denis-Courmont http://ww

Re: [FFmpeg-devel] [PATCH v2 2/5] lavc/vp9dsp: R-V V mc bilin h v

2024-05-26 Thread Rémi Denis-Courmont

Le tiistaina 21. toukokuuta 2024, 21.36.33 EEST flow gg a écrit : > Do macros definition also need a comma? Indeed not. I'd rather be safe than sorry though. -- レミ・デニ-クールモン http://www.remlab.net/ ___ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org ht

Re: [FFmpeg-devel] [PATCH] checkasm/lpc: test compute_autocorr

2024-05-26 Thread Rémi Denis-Courmont

Le lauantaina 25. toukokuuta 2024, 14.58.53 EEST Rémi Denis-Courmont a écrit : > -- > Change since previous version: > - Disable test on x86, since the x86 optimisation fails > with a suspiciously large and regular error with 10-20% probability. > --- > tests/c

[FFmpeg-devel] [PATCH] Revert "avformat/file: seek back to initial position for fd protocol"

2024-05-27 Thread Rémi Denis-Courmont

The generally expected behaviour is for software to append from the current position, not to keep resetting. See also how shell works: # (foo && bar) > file ...would normally concatenate the output of both commands to file. If foo seeks like lavf, bar would instead overwrite the output of foo (po

[FFmpeg-devel] [PATCH 1/2] lavc/lpc: fix off-by-one in R-V V compute_autocorr

2024-05-27 Thread Rémi Denis-Courmont

--- libavcodec/riscv/lpc_init.c | 2 +- libavcodec/riscv/lpc_rvv.S | 1 + 2 files changed, 2 insertions(+), 1 deletion(-) diff --git a/libavcodec/riscv/lpc_init.c b/libavcodec/riscv/lpc_init.c index f21eca4caa..d9bcbbe604 100644 --- a/libavcodec/riscv/lpc_init.c +++ b/libavcodec/riscv/lpc_init.c

[FFmpeg-devel] [PATCH 2/2] riscv: allow passing addend to vtype_vli macro

2024-05-27 Thread Rémi Denis-Courmont

A constant (-1) is added to the length value, so we can have an added for free, and optimise the addition away if the addend is exactly 1. --- libavcodec/riscv/lpc_rvv.S | 2 +- libavutil/riscv/asm.S | 9 ++--- 2 files changed, 7 insertions(+), 4 deletions(-) diff --git a/libavcodec/risc

[FFmpeg-devel] [PATCHv3] checkasm/lpc: test compute_autocorr

2024-05-27 Thread Rémi Denis-Courmont

--- Changes since v2: - Scale the error factor to length since this computes sums. - Check the last element from results. - Use fixed vector size for benchmarks. --- tests/checkasm/lpc.c | 51 +--- 1 file changed, 48 insertions(+), 3 deletions(-) diff --gi

Re: [FFmpeg-devel] [PATCHv3] checkasm/lpc: test compute_autocorr

2024-05-27 Thread Rémi Denis-Courmont

Le maanantaina 27. toukokuuta 2024, 19.01.59 EEST Rémi Denis-Courmont a écrit : > --- > Changes since v2: > - Scale the error factor to length since this computes sums. > - Check the last element from results. > - Use fixed vector size for benchmarks. - Test real use lag values (f

[FFmpeg-devel] [PATCH] lavc/startcode: fix RVV return value on no match

2024-05-27 Thread Rémi Denis-Courmont

If there are no zero bytes, t2 equals -1. The code cannot simply fall through to the match case. --- libavcodec/riscv/startcode_rvv.S | 2 ++ 1 file changed, 2 insertions(+) diff --git a/libavcodec/riscv/startcode_rvv.S b/libavcodec/riscv/startcode_rvv.S index 7c43b1d7f3..36a3369431 100644 --- a/

Re: [FFmpeg-devel] [PATCHv3] checkasm/lpc: test compute_autocorr

2024-05-27 Thread Rémi Denis-Courmont

Le maanantaina 27. toukokuuta 2024, 22.15.40 EEST James Almer a écrit : > On 5/27/2024 4:10 PM, James Almer wrote: > > On 5/27/2024 1:01 PM, Rémi Denis-Courmont wrote: > >> --- > >> Changes since v2: > >> - Scale the error factor to length since this computes

Re: [FFmpeg-devel] [PATCH 5/7] avcodec/vc2enc: Avoid relocations for short strings

2024-05-27 Thread Rémi Denis-Courmont

Le 28 mai 2024 06:07:41 GMT+03:00, James Almer a écrit : >On 5/27/2024 11:49 PM, Andreas Rheinhardt wrote: >> These strings are so short that they can be put directly >> into the containing structure, avoiding the pointer >> and putting it into .rodata. >> Also use chars for interlaced and level

[FFmpeg-devel] [PATCH] checkasm: add linear least square tests

2024-05-28 Thread Rémi Denis-Courmont

--- tests/checkasm/Makefile | 1 + tests/checkasm/checkasm.c | 1 + tests/checkasm/checkasm.h | 1 + tests/checkasm/lls.c | 110 ++ tests/fate/checkasm.mak | 1 + 5 files changed, 114 insertions(+) create mode 100644 tests/checkasm/lls.c diff

[FFmpeg-devel] [PATCH 1/4] lavu/float_dsp: add double-precision scalar product

2024-05-29 Thread Rémi Denis-Courmont

The function pointer is appended to the structure for backward binary compatibility. Fortunately, this is allocated by libavutil, not by the user, so increasing the structure size is safe. --- doc/APIchanges| 3 +++ libavutil/float_dsp.c | 12 libavutil/float_dsp.h | 14 +

[FFmpeg-devel] [PATCH 2/4] lavf: get rid of bespoke double scalar products

2024-05-29 Thread Rémi Denis-Courmont

--- libavfilter/aap_template.c | 14 +- libavfilter/anlms_template.c | 16 ++-- libavfilter/arls_template.c | 14 +- 3 files changed, 4 insertions(+), 40 deletions(-) diff --git a/libavfilter/aap_template.c b/libavfilter/aap_template.c index ea9c815a89..0e05

[FFmpeg-devel] [PATCH 3/4] checkasm/float_dsp: add double-precision scalar product

2024-05-29 Thread Rémi Denis-Courmont

--- tests/checkasm/float_dsp.c | 19 +++ 1 file changed, 19 insertions(+) diff --git a/tests/checkasm/float_dsp.c b/tests/checkasm/float_dsp.c index cadfa65e2a..296db1cff9 100644 --- a/tests/checkasm/float_dsp.c +++ b/tests/checkasm/float_dsp.c @@ -278,6 +278,22 @@ static void tes

[FFmpeg-devel] [PATCH 4/4] lavc/float_dsp: R-V V scalarproduct_double

2024-05-29 Thread Rémi Denis-Courmont

C908: scalarproduct_double_c: 39.2 scalarproduct_double_rvv_f64: 10.5 X60: scalarproduct_double_c: 35.0 scalarproduct_double_rvv_f64: 5.2 --- libavutil/riscv/float_dsp_init.c | 3 +++ libavutil/riscv/float_dsp_rvv.S | 21 + 2 files changed, 24 insertions(+) dif

[FFmpeg-devel] [PATCH] lavu/lls: R-V V update_lls

2024-05-29 Thread Rémi Denis-Courmont

til/riscv/lls_init.c b/libavutil/riscv/lls_init.c new file mode 100644 index 00..b1317dc925 --- /dev/null +++ b/libavutil/riscv/lls_init.c @@ -0,0 +1,57 @@ +/* + * Copyright © 2024 Rémi Denis-Courmont. + * + * This file is part of FFmpeg. + * + * FFmpeg is free software; you can redistribute

[FFmpeg-devel] [PATCH] checkasm: add linear least square tests

2024-05-29 Thread Rémi Denis-Courmont

--- tests/checkasm/Makefile | 1 + tests/checkasm/checkasm.c | 1 + tests/checkasm/checkasm.h | 1 + tests/checkasm/lls.c | 101 ++ tests/fate/checkasm.mak | 1 + 5 files changed, 105 insertions(+) create mode 100644 tests/checkasm/lls.c diff

Re: [FFmpeg-devel] [PATCH 1/4] lavu/float_dsp: add double-precision scalar product

2024-05-29 Thread Rémi Denis-Courmont

alignment beyond that imposed by > double? The C and RISC-V implementations require the natural alignment of double, 64- bit. If somebody wants to increase the required alignment(s), they are free to audit the call sites and update the comments accordingly, since this won't be an ABI br

1 2 3 4 5 6 7 8 9 10 >

1 - 100 of 1371 matches

Mail list logo