On Sat, Jul 26, 2014 at 11:50:05AM +0200, Mickaël Raulet wrote:
> Hi,
> 
> > Only 8-bit and 10-bit idct_dc() functions are included (adding others 
> > should be trivial).
> > 
> > Benchmarks on an Intel Core i5-4200U:
> > 
> > idct8x8_dc
> >       SSE2   MMXEXT  C
> > cycles 22     26      57
> > 
> > idct16x16_dc
> >       AVX2   SSE2    C
> > cycles 27     32      249
> > 
> > idct32x32_dc
> >       AVX2   SSE2    C
> > cycles 62     126     1375
> > 
> > Signed-off-by: James Almer <jamr...@gmail.com>
> > ---
> > libavcodec/hevcdsp.h          |   2 -
> > libavcodec/x86/hevc_idct.asm  | 249 
> > +++++++++++-------------------------------
> > libavcodec/x86/hevcdsp.h      |  28 -----
> > libavcodec/x86/hevcdsp_init.c |  95 +++++-----------
> > libavutil/x86/x86util.asm     |   4 +-
> > 5 files changed, 93 insertions(+), 285 deletions(-)
> > 
> 
> patch ok. we are currently working in transform_add.

applied

thanks

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Asymptotically faster algorithms should always be preferred if you have
asymptotical amounts of data

Attachment: signature.asc
Description: Digital signature

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
http://ffmpeg.org/mailman/listinfo/ffmpeg-devel

Reply via email to