[FFmpeg-devel] [PATCH] Revised ff_v210_planar_unpack AVX2

2019-03-12 Thread Michael Stoner
Replaced VSHUFPS with VPBLENDD to relieve port 5 bottleneck AVX2 is now 1.4x faster than AVX Tested on Broadwell CPU, Ubuntu 18.10 x86_64 ~/FFmpeg$ tests/checkasm/checkasm --bench --test=v210dec benchmarking with native FFmpeg timers nop: 94.1 checkasm: using random seed 3963743306 SSSE3: - v210

Re: [FFmpeg-devel] [PATCH] Revised ff_v210_planar_unpack AVX2

2019-03-12 Thread Mike Stoner
I am submitting another patch.  Please disregard this one. -Mike ___ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

[FFmpeg-devel] [PATCH] Revised ff_v210_planar_unpack AVX2

2019-03-06 Thread Michael Stoner
--- libavcodec/v210dec.c | 10 +- libavcodec/x86/v210-init.c | 8 + libavcodec/x86/v210.asm| 63 -- 3 files changed, 64 insertions(+), 17 deletions(-) diff --git a/libavcodec/v210dec.c b/libavcodec/v210dec.c index ddc5dbe8be..26954c0df3 10064