Hello,

Le maanantaina 19. helmikuuta 2024, 13.13.43 EET flow gg a écrit :
> The reason for using m1+le8 instead of stride load + larger group
> multipliers is the same as in "[FFmpeg-devel] [PATCH 1/7] lavc/me_cmp: R-V
> V pix_abs."
> 
> In the test, there is
> 
> #define src (buf + 2 * SRC_BUF_STRIDE + 2 + 1)
> 
> Therefore, not using e8 will result : (fatal signal 7: Bus error).

Yes, you could also just say that alignment is insufficient :)

It is still possible to load rectangles of up to 8 columns using vlseg8e8, but 
it might be slower than just repeating the 8 regular loads, and it won't work 
if you need calculations between rows.

I may be missing something but I don't understand what purpose the header file 
serves here?

-- 
Rémi Denis-Courmont
http://www.remlab.net/



_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to