On 2017-10-28 17:57, Martin Vignali wrote:
>>> For this, i follow Henrik Gramner's code, in the discussion
>>> libavcodec/exr : add x86 SIMD for predictor.
>>
>> That's fine. I will look at that on my own time. I don't want to hold
>> up applying the patch. Would you like me to push it for
> >>
> > For this, i follow Henrik Gramner's code, in the discussion
> > libavcodec/exr : add x86 SIMD for predictor.
>
> That's fine. I will look at that on my own time. I don't want to hold
> up applying the patch. Would you like me to push it for you?
>
>
> Yes please push it if you can.
Than
On 2017-10-28 16:14, Martin Vignali wrote:
> 2017-10-28 15:47 GMT+02:00 James Darnley :
>
>> On 2017-10-22 19:14, Martin Vignali wrote:
>>> @@ -86,16 +101,20 @@ SECTION .text
>>>
>>> ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w);
>>> %macro BSWAP32_BUF 0
>>> -%if cpuflag(ssse3)
2017-10-28 15:47 GMT+02:00 James Darnley :
> On 2017-10-22 19:14, Martin Vignali wrote:
> > @@ -86,16 +101,20 @@ SECTION .text
> >
> > ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w);
> > %macro BSWAP32_BUF 0
> > -%if cpuflag(ssse3)
> > +%if cpuflag(ssse3)||cpuflag(avx2)
> > cglo
On 2017-10-22 19:14, Martin Vignali wrote:
> @@ -86,16 +101,20 @@ SECTION .text
>
> ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w);
> %macro BSWAP32_BUF 0
> -%if cpuflag(ssse3)
> +%if cpuflag(ssse3)||cpuflag(avx2)
> cglobal bswap32_buf, 3,4,3
> mov r3, r1
> +%if cpufl
2017-10-25 9:39 GMT+02:00 Paul B Mahol :
> On 10/22/17, Martin Vignali wrote:
> > Hello,
> >
> > In attach patch in order to add an AVX2 version for bswap_buf
> > (swap uint32 in a buffer)
> >
> > Checkasm result (Kaby Lake 10.12)
> > bswap_buf_c: 122.8
> > bswap_buf_sse2: 67.8
> > bswap_buf_ssse
On 10/22/17, Martin Vignali wrote:
> Hello,
>
> In attach patch in order to add an AVX2 version for bswap_buf
> (swap uint32 in a buffer)
>
> Checkasm result (Kaby Lake 10.12)
> bswap_buf_c: 122.8
> bswap_buf_sse2: 67.8
> bswap_buf_ssse3: 34.3
> bswap_buf_avx2: 21.0
>
>
> Pass checkasm and fate te
Hello,
In attach patch in order to add an AVX2 version for bswap_buf
(swap uint32 in a buffer)
Checkasm result (Kaby Lake 10.12)
bswap_buf_c: 122.8
bswap_buf_sse2: 67.8
bswap_buf_ssse3: 34.3
bswap_buf_avx2: 21.0
Pass checkasm and fate test for me
Martin
Jokyo Images
0001-libavcodec-bswapdsp