Re: [FFmpeg-devel] [PATCH] x86/me_cmp: port mmxext and sse2 sad functions to yasm

2014-09-17 Thread Michael Niedermayer
On Tue, Sep 16, 2014 at 09:41:47PM -0300, James Almer wrote: > Also add a missing c->pix_abs[0][0] initialization, and sse2 versions of > sad16_x2, sad16_y2 and sad16_xy2 (%15 to %20 faster than mmxext). > Since the _xy2 versions are not bitexact, they are accordingly marked as > approximate. > >

[FFmpeg-devel] [PATCH] x86/me_cmp: port mmxext and sse2 sad functions to yasm

2014-09-16 Thread James Almer
Also add a missing c->pix_abs[0][0] initialization, and sse2 versions of sad16_x2, sad16_y2 and sad16_xy2 (%15 to %20 faster than mmxext). Since the _xy2 versions are not bitexact, they are accordingly marked as approximate. Signed-off-by: James Almer --- libavcodec/x86/me_cmp.asm| 330 +

Re: [FFmpeg-devel] [PATCH] x86/me_cmp: port mmxext and sse2 sad functions to yasm

2014-09-14 Thread Michael Niedermayer
On Sun, Sep 14, 2014 at 07:35:26PM -0300, James Almer wrote: > On 14/09/14 7:12 PM, Michael Niedermayer wrote: > > On Sat, Sep 13, 2014 at 10:12:12PM -0300, James Almer wrote: > >> Also add a missing c->pix_abs[0][0] initialization, and sse2 versions of > >> sad16_x2, sad16_y2 and sad16_xy2. > >> S

Re: [FFmpeg-devel] [PATCH] x86/me_cmp: port mmxext and sse2 sad functions to yasm

2014-09-14 Thread James Almer
On 14/09/14 7:12 PM, Michael Niedermayer wrote: > On Sat, Sep 13, 2014 at 10:12:12PM -0300, James Almer wrote: >> Also add a missing c->pix_abs[0][0] initialization, and sse2 versions of >> sad16_x2, sad16_y2 and sad16_xy2. >> Since the _xy2 versions are not bitexact, they are accordingly marked as

Re: [FFmpeg-devel] [PATCH] x86/me_cmp: port mmxext and sse2 sad functions to yasm

2014-09-14 Thread Michael Niedermayer
On Sat, Sep 13, 2014 at 10:12:12PM -0300, James Almer wrote: > Also add a missing c->pix_abs[0][0] initialization, and sse2 versions of > sad16_x2, sad16_y2 and sad16_xy2. > Since the _xy2 versions are not bitexact, they are accordingly marked as > approximate. > > Signed-off-by: James Almer > --

[FFmpeg-devel] [PATCH] x86/me_cmp: port mmxext and sse2 sad functions to yasm

2014-09-13 Thread James Almer
Also add a missing c->pix_abs[0][0] initialization, and sse2 versions of sad16_x2, sad16_y2 and sad16_xy2. Since the _xy2 versions are not bitexact, they are accordingly marked as approximate. Signed-off-by: James Almer --- Not benched. libavcodec/x86/me_cmp.asm| 229 +++