Re: [FFmpeg-devel] [PATCH v2 2/3] swscale/x86/output: add AVX2 version of yuv2nv12cX

2020-04-24 Thread James Almer
On 4/24/2020 1:31 PM, Nelson Gomez wrote: > From: Nelson Gomez > > 256 bits is just wide enough to fit all the operands needed to vectorize > the software implementation, but AVX2 is needed to for a couple of > instructions like cross-lane permutation. > > Output is bit-for-bit identical to C. >

[FFmpeg-devel] [PATCH v2 2/3] swscale/x86/output: add AVX2 version of yuv2nv12cX

2020-04-24 Thread Nelson Gomez
From: Nelson Gomez 256 bits is just wide enough to fit all the operands needed to vectorize the software implementation, but AVX2 is needed to for a couple of instructions like cross-lane permutation. Output is bit-for-bit identical to C. Signed-off-by: Nelson Gomez --- libswscale/x86/output.