On Sun, Aug 18, 2024 at 10:13 PM Ramiro Polla <ramiro.po...@gmail.com> wrote:
>
>                    A53             A76
> pix_norm1_c:     519.2           231.5
> pix_norm1_neon:  195.0 ( 2.66x)   44.2 ( 5.24x)
> pix_sum_c:       344.5           242.2
> pix_sum_neon:    119.0 ( 2.89x)   41.7 ( 5.81x)

This new patchset no longer uses unrolled loops. Even though checkasm
reported the unrolled versions to be faster, in a real encoding
use-case linux perf reports that the non-unrolled versions are faster.
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to