On 2/20/2017 12:33 PM, James Darnley wrote:
> 6 more functions which eke out a little more speed.
> 
> James Darnley (6):
>   avcodec/h264: add avx 8-bit chroma v deblock/loop filter
>   avcodec/h264: add avx 8-bit 4:2:0 chroma h deblock/loop filter
>   avcodec/h264: add avx 8-bit 4:2:2 chroma h deblock/loop filter
>   avcodec/h264: add avx 8-bit chroma v intra deblock/loop filter
>   avcodec/h264: add avx 8-bit 4:2:0 chroma h intra deblock/loop filter
>   avcodec/h264: add avx 8-bit 4:2:2 chroma h intra deblock/loop filter
> 
>  libavcodec/x86/h264_deblock.asm | 195 
> ++++++++++++++++++++++++++++++++++++++++
>  libavcodec/x86/h264dsp_init.c   |  10 +++
>  2 files changed, 205 insertions(+)

All of these can and should be SSE2 first and foremost, and AVX only if
there's any gain on top of it from using the three operand version of the
instructions.

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
http://ffmpeg.org/mailman/listinfo/ffmpeg-devel

Reply via email to