On 2016-01-14 21:42, Henrik Gramner wrote:
> On Thu, Jan 14, 2016 at 9:27 PM, James Darnley
> wrote:
>> On 2016-01-14 20:21, Henrik Gramner wrote:
>>> xmN can be used unconditionally which gets rid of the %else. E.g.
>>>
>>> movu xm1, [yq+widthq*2]
>>> %if cpuflag(avx2)
>>> vinserti
On Thu, Jan 14, 2016 at 9:27 PM, James Darnley wrote:
> On 2016-01-14 20:21, Henrik Gramner wrote:
>> xmN can be used unconditionally which gets rid of the %else. E.g.
>>
>> movu xm1, [yq+widthq*2]
>> %if cpuflag(avx2)
>> vinserti128 m1, m1, [yq+widthq*2+12], 1
>> %endif
>
> I can ch
On 2016-01-14 20:21, Henrik Gramner wrote:
> On Wed, Jan 13, 2016 at 4:55 PM, James Darnley
> wrote:
>> diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm
>> index 859e2d9..a8f3d3c 100644
>> --- a/libavcodec/x86/v210enc.asm
>> +++ b/libavcodec/x86/v210enc.asm
>> -cextern pb_FE
>
On Wed, Jan 13, 2016 at 4:55 PM, James Darnley wrote:
> diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm
> index 859e2d9..a8f3d3c 100644
> --- a/libavcodec/x86/v210enc.asm
> +++ b/libavcodec/x86/v210enc.asm
> -cextern pb_FE
> -%define v210_enc_max_8 pb_FE
> +;cextern pb_FE
> +l
Around 35% faster than the avx version.
---
libavcodec/v210enc.c | 5 ++--
libavcodec/v210enc.h | 1 +
libavcodec/x86/v210enc.asm| 53 +++
libavcodec/x86/v210enc_init.c | 7 ++
4 files changed, 49 insertions(+), 17 deletions(-)