As of 459a1512f13ae1fdd404ff9281b02d225ac70a09, the code is unrolled to process two rows per iteration. The output cursor thus needs to be incremented by twice the stride, which is taken care of with SH1ADD. However the original ADD from the original implemetation was incorrectly left over. --- libavcodec/riscv/h264dsp_rvv.S | 1 - 1 file changed, 1 deletion(-)
diff --git a/libavcodec/riscv/h264dsp_rvv.S b/libavcodec/riscv/h264dsp_rvv.S index 44f6202602..5423b716ca 100644 --- a/libavcodec/riscv/h264dsp_rvv.S +++ b/libavcodec/riscv/h264dsp_rvv.S @@ -52,7 +52,6 @@ func ff_h264_weight_pixels_simple_8_rvv, zve32x, b vse8.v v8, (a0) vse8.v v9, (t0) sh1add a0, a1, a0 - add a0, a0, a1 bnez a2, 1b ret -- 2.45.2 _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".