On Tue, 20 Sep 2022, Hubert Mazur wrote:
This fixes issues addressed in previous patchset:
- move sub instruction in vsad8_intra,
- remove unnecessary mov instructions,
- remove single lane extraction in loop and place it at the end.
Removing mov instructions from pix_median_abs functions significantly
increased peformance for both.
I'm quite sure that it wasn't the removed mov instructions that improved
performance (those instructions should be essentially free, they're just
misleading), but the fact that you got rid of the extra single-element
handling within the loop.
Anyway, the patches seem fine to me now, so I'll push them. Thanks!
// Martin
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".