On Mon, Oct 12, 2015 at 07:37:47PM +0200, Christophe Gisquet wrote: > Modeled from the prores version. Clips to [0;1023] and is bitexact. > Bitexactness requires to add offsets in different places compared to > prores or C, and makes the function approximately 2% slower. > > For 16 frames of a DNxHD 4:2:2 10bits test sequence: > > C: 60861 decicycles in idct, 1048205 runs, 371 skips > sse2: 27567 decicycles in idct, 1048216 runs, 360 skips > avx: 26272 decicycles in idct, 1048171 runs, 405 skips > > The pure and add versions are not implemented, so the corresponding dsp > functions are set to NULL to make it clear in a code executing them. > --- > libavcodec/x86/Makefile | 1 + > libavcodec/x86/idctdsp_init.c | 20 +++++++++++++++ > libavcodec/x86/simple_idct.h | 3 +++ > libavcodec/x86/simple_idct10.asm | 54 > ++++++++++++++++++++++++++++++++++++++++ > 4 files changed, 78 insertions(+) > create mode 100644 libavcodec/x86/simple_idct10.asm
applied thanks [...] -- Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB Freedom in capitalist society always remains about the same as it was in ancient Greek republics: Freedom for slave owners. -- Vladimir Lenin
signature.asc
Description: Digital signature
_______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel