On Thu, 29 Jun 2023, John Cox wrote:

Also adds a filter_line3 method which on aarch64 neon yields approx 30%
speedup over 2xfilter_line and a memcpy

John Cox (15):
 avfilter/vf_bwdif: Add outline for aarch neon functions
 avfilter/vf_bwdif: Add common macros and consts for aarch64 neon
 avfilter/vf_bwdif: Export C filter_intra
 avfilter/vf_bwdif: Add neon for filter_intra
 tests/checkasm: Add test for vf_bwdif filter_intra
 avfilter/vf_bwdif: Add clip and spatial macros for aarch64 neon
 avfilter/vf_bwdif: Export C filter_edge
 avfilter/vf_bwdif: Add neon for filter_edge
 tests/checkasm: Add test for vf_bwdif filter_edge
 avfilter/vf_bwdif: Export C filter_line
 avfilter/vf_bwdif: Add neon for filter_line
 avfilter/vf_bwdif: Add a filter_line3 method for optimisation
 avfilter/vf_bwdif: Add neon for filter_line3
 tests/checkasm: Add test for vf_bwdif filter_line3
 avfilter/vf_bwdif: Block filter slices into a multiple of 4 lines

It's nice to have this split up in small easily checkable patches, but this is perhaps a bit more finegrained than what's usual. But I guess that's ok...

I'll comment on the patches that need commenting on.

// Martin

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to