So here is my second work in progress patch set. These are the functions from the second half of my previous.
With Ronald's rounding contribution I am much closer to finishing. As I mentioned on IRC I think the speed of decoding an MPEG2 HD sample has increased from 215 to 235 fps. I still have a small optimisation to make and I need to use the correct coefficients. This will require a large change to the macros. I am sending this so that people can nitpick my changes. James Darnley (5): avcodec/x86: cleanup simple_idct10 avcodec/x86: add x86-64 8-bit simple_idct function more cleanup avcodec/x86: add x86-64 8-bit simple_idct put function avcodec/x86: add x86-64 8-bit simple_idct add function libavcodec/tests/x86/dct.c | 2 + libavcodec/x86/idctdsp_init.c | 23 +++++++ libavcodec/x86/proresdsp.asm | 2 +- libavcodec/x86/simple_idct.h | 9 +++ libavcodec/x86/simple_idct10.asm | 100 ++++++++++++++++++++++++++++-- libavcodec/x86/simple_idct10_template.asm | 80 +++++++++++------------- 6 files changed, 167 insertions(+), 49 deletions(-) -- 2.13.0 _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel