Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-14 Thread James Almer
On 14/07/15 8:35 PM, James Darnley wrote: > Speed of all modes increased by a factor between 7.4 and 19.8 largely > depending > on whether bytes are unpacked into words. Modes 2, 3, and 4 have been sped-up > by a factor of 43 (thanks quick sort!) > > All modes are available on x86_64 but only mo

[FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-14 Thread James Darnley
Speed of all modes increased by a factor between 7.4 and 19.8 largely depending on whether bytes are unpacked into words. Modes 2, 3, and 4 have been sped-up by a factor of 43 (thanks quick sort!) All modes are available on x86_64 but only modes 1, 10, 11, 12, 13, 14, 19, 20, 21, and 22 are avail

Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-14 Thread James Darnley
On 2015-07-14 23:23, James Almer wrote: > On 14/07/15 3:54 PM, James Darnley wrote: >> On 2015-07-11 18:34, James Almer wrote: >>> On 11/07/15 10:40 AM, James Darnley wrote: new file mode 100644 index 000..5e1feea --- /dev/null +++ b/libavfilter/x86/vf_removegrain.asm @

Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-14 Thread James Almer
On 14/07/15 3:54 PM, James Darnley wrote: > On 2015-07-11 18:34, James Almer wrote: >> On 11/07/15 10:40 AM, James Darnley wrote: >>> @@ -566,7 +554,19 @@ static int filter_slice(AVFilterContext *ctx, void >>> *arg, int jobnr, int nb_jobs) >>> } >>> >>> *dst++ = *src++; >>> -

Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-14 Thread James Darnley
On 2015-07-11 18:34, James Almer wrote: > On 11/07/15 10:40 AM, James Darnley wrote: >> @@ -566,7 +554,19 @@ static int filter_slice(AVFilterContext *ctx, void >> *arg, int jobnr, int nb_jobs) >> } >> >> *dst++ = *src++; >> -for (x = 1; x < s->planewidth[i] - 1; x++) {

Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-11 Thread James Almer
On 11/07/15 10:40 AM, James Darnley wrote: > Speed of all modes increased by a factor between 7.4 and 19.8 largely > depending > on whether bytes are unpacked into words. Modes 2, 3, and 4 have been sped-up > by a factor of 43 (thanks quick sort!) > > All modes are available on x86_64 but only m

[FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-11 Thread James Darnley
Speed of all modes increased by a factor between 7.4 and 19.8 largely depending on whether bytes are unpacked into words. Modes 2, 3, and 4 have been sped-up by a factor of 43 (thanks quick sort!) All modes are available on x86_64 but only modes 1, 10, 11, 12, 13, 14, 19, 20, 21, and 22 are avail

Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-11 Thread James Darnley
On 2015-07-11 15:45, Carl Eugen Hoyos wrote: > James Darnley gmail.com> writes: > >> +;* TODO: gpl text goes here. > > Yes, please copy it from another asm file. > > And please make sure that the file does > not get compiled if --enable-gpl was not > specified. Dammit! Of course I forgot to

Re: [FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-11 Thread Carl Eugen Hoyos
James Darnley gmail.com> writes: > +;* TODO: gpl text goes here. Yes, please copy it from another asm file. And please make sure that the file does not get compiled if --enable-gpl was not specified. Carl Eugen ___ ffmpeg-devel mailing list ffmpeg

[FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

2015-07-11 Thread James Darnley
Speed of all modes increased by a factor between 7.4 and 19.8 largely depending on whether bytes are unpacked into words. Modes 2, 3, and 4 have been sped-up by a factor of 43 (thanks quick sort!) All modes are available on x86_64 but only modes 1, 10, 11, 12, 13, 14, 19, 20, 21, and 22 are avail