On Sun, Aug 18, 2024 at 10:13 PM Ramiro Polla <ramiro.po...@gmail.com> wrote: > > A53 A76 > pix_norm1_c: 519.2 231.5 > pix_norm1_neon: 195.0 ( 2.66x) 44.2 ( 5.24x) > pix_sum_c: 344.5 242.2 > pix_sum_neon: 119.0 ( 2.89x) 41.7 ( 5.81x)
This new patchset no longer uses unrolled loops. Even though checkasm reported the unrolled versions to be faster, in a real encoding use-case linux perf reports that the non-unrolled versions are faster. _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".