Hello, The following patchset add the ff_simple_idct function neon functions for the aarch64 platform. It's ported from armv7 simple_idct_neon with some improvements: * the source idct blocks are now loaded once and kept in v24-v31 * the source idct blocks are no longer overriden in idct_col4_top * the destination is now written in one pass at the end of ff_simple_idct{,_put,_add}_neon
It is bitexact with the armv7 neon implementation. Here are some results (reported by {START,STOP}_TIMER) on an Odroid-C2 (Cortex A53): Functions IDCT: simple IDCT: simpleneon ff_simple_idct_put 9795 units 3170 units ff_simple_idct_add 10227 units 3302 units Matthieu _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel