Hello, Le maanantaina 19. helmikuuta 2024, 13.13.43 EET flow gg a écrit : > The reason for using m1+le8 instead of stride load + larger group > multipliers is the same as in "[FFmpeg-devel] [PATCH 1/7] lavc/me_cmp: R-V > V pix_abs." > > In the test, there is > > #define src (buf + 2 * SRC_BUF_STRIDE + 2 + 1) > > Therefore, not using e8 will result : (fatal signal 7: Bus error).
Yes, you could also just say that alignment is insufficient :) It is still possible to load rectangles of up to 8 columns using vlseg8e8, but it might be slower than just repeating the 8 regular loads, and it won't work if you need calculations between rows. I may be missing something but I don't understand what purpose the header file serves here? -- Rémi Denis-Courmont http://www.remlab.net/ _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".