Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-29 Thread James Darnley
On 2017-10-28 17:57, Martin Vignali wrote: >>> For this, i follow Henrik Gramner's code, in the discussion >>> libavcodec/exr : add x86 SIMD for predictor. >> >> That's fine. I will look at that on my own time. I don't want to hold >> up applying the patch. Would you like me to push it for

Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-28 Thread Martin Vignali
> >> > > For this, i follow Henrik Gramner's code, in the discussion > > libavcodec/exr : add x86 SIMD for predictor. > > That's fine. I will look at that on my own time. I don't want to hold > up applying the patch. Would you like me to push it for you? > > > Yes please push it if you can. Than

Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-28 Thread James Darnley
On 2017-10-28 16:14, Martin Vignali wrote: > 2017-10-28 15:47 GMT+02:00 James Darnley : > >> On 2017-10-22 19:14, Martin Vignali wrote: >>> @@ -86,16 +101,20 @@ SECTION .text >>> >>> ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w); >>> %macro BSWAP32_BUF 0 >>> -%if cpuflag(ssse3)

Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-28 Thread Martin Vignali
2017-10-28 15:47 GMT+02:00 James Darnley : > On 2017-10-22 19:14, Martin Vignali wrote: > > @@ -86,16 +101,20 @@ SECTION .text > > > > ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w); > > %macro BSWAP32_BUF 0 > > -%if cpuflag(ssse3) > > +%if cpuflag(ssse3)||cpuflag(avx2) > > cglo

Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-28 Thread James Darnley
On 2017-10-22 19:14, Martin Vignali wrote: > @@ -86,16 +101,20 @@ SECTION .text > > ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w); > %macro BSWAP32_BUF 0 > -%if cpuflag(ssse3) > +%if cpuflag(ssse3)||cpuflag(avx2) > cglobal bswap32_buf, 3,4,3 > mov r3, r1 > +%if cpufl

Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-28 Thread Martin Vignali
2017-10-25 9:39 GMT+02:00 Paul B Mahol : > On 10/22/17, Martin Vignali wrote: > > Hello, > > > > In attach patch in order to add an AVX2 version for bswap_buf > > (swap uint32 in a buffer) > > > > Checkasm result (Kaby Lake 10.12) > > bswap_buf_c: 122.8 > > bswap_buf_sse2: 67.8 > > bswap_buf_ssse

Re: [FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-25 Thread Paul B Mahol
On 10/22/17, Martin Vignali wrote: > Hello, > > In attach patch in order to add an AVX2 version for bswap_buf > (swap uint32 in a buffer) > > Checkasm result (Kaby Lake 10.12) > bswap_buf_c: 122.8 > bswap_buf_sse2: 67.8 > bswap_buf_ssse3: 34.3 > bswap_buf_avx2: 21.0 > > > Pass checkasm and fate te

[FFmpeg-devel] libavcodec/bswapdsp : add AVX2 for bswap_buf

2017-10-22 Thread Martin Vignali
Hello, In attach patch in order to add an AVX2 version for bswap_buf (swap uint32 in a buffer) Checkasm result (Kaby Lake 10.12) bswap_buf_c: 122.8 bswap_buf_sse2: 67.8 bswap_buf_ssse3: 34.3 bswap_buf_avx2: 21.0 Pass checkasm and fate test for me Martin Jokyo Images 0001-libavcodec-bswapdsp