On Mon, Oct 12, 2015 at 07:37:47PM +0200, Christophe Gisquet wrote:
> Modeled from the prores version. Clips to [0;1023] and is bitexact.
> Bitexactness requires to add offsets in different places compared to
> prores or C, and makes the function approximately 2% slower.
> 
> For 16 frames of a DNxHD 4:2:2 10bits test sequence:
> 
> C:    60861 decicycles in idct, 1048205 runs,    371 skips
> sse2: 27567 decicycles in idct, 1048216 runs,    360 skips
> avx:  26272 decicycles in idct, 1048171 runs,    405 skips
> 
> The pure and add versions are not implemented, so the corresponding dsp
> functions are set to NULL to make it clear in a code executing them.
> ---
>  libavcodec/x86/Makefile          |  1 +
>  libavcodec/x86/idctdsp_init.c    | 20 +++++++++++++++
>  libavcodec/x86/simple_idct.h     |  3 +++
>  libavcodec/x86/simple_idct10.asm | 54 
> ++++++++++++++++++++++++++++++++++++++++
>  4 files changed, 78 insertions(+)
>  create mode 100644 libavcodec/x86/simple_idct10.asm

applied

thanks

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Freedom in capitalist society always remains about the same as it was in
ancient Greek republics: Freedom for slave owners. -- Vladimir Lenin

Attachment: signature.asc
Description: Digital signature

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
http://ffmpeg.org/mailman/listinfo/ffmpeg-devel

Reply via email to