Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread compn
On Mon, 23 May 2016 14:30:54 +0200 Håvard Espeland wrote: > Currently, we are also developing a version of the same encoder for > Nvidia TX1 with Cuda/Neon SIMD for supporting multiple high quality > streams in real time using ffmpeg. I guess there is little interest > in merging this as well, bu

Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Håvard Espeland
>> Have you test your optimizations in the other prores encoder (prores > kostya) (who i think have more features (interlaced encoding and 444 > versions)) ? No, we haven’t done this yet. The goal for us was performance without loosing accuracy. We choose Anatoliy simply because it was faster. I

Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Håvard Espeland
Hi Paul, > On 23 May 2016, at 13:13, Paul B Mahol wrote: > > On 5/23/16, Haavard Espeland wrote: >> Hi guys, >> >> We have been working on Prores Anatoliy optimizations to get the speed up on >> an embedded x86 platform. Fdct (10bit), scaling and encoding of code words >> have been optimized w

Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Martin Vignali
2016-05-23 13:44 GMT+02:00 Håvard Espeland : > > > The SIMD won't be accepted if it's intrinsics. The codeword encoding is > not > > SIMD, is it? So that may be worth upstreaming. > > All optimizations we’ve done are SIMD so it does not apply. Basically what > we do for codewords is to process the

Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Håvard Espeland
> The SIMD won't be accepted if it's intrinsics. The codeword encoding is not > SIMD, is it? So that may be worth upstreaming. All optimizations we’ve done are SIMD so it does not apply. Basically what we do for codewords is to process the shifting/masking for eight codewords at a time. The put

Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Paul B Mahol
On 5/23/16, Haavard Espeland wrote: > Hi guys, > > We have been working on Prores Anatoliy optimizations to get the speed up on > an embedded x86 platform. Fdct (10bit), scaling and encoding of code words > have been optimized with AVX2 instructions, and the performance is increased > by roughly 4

Re: [FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Ronald S. Bultje
Hi Havard, On Mon, May 23, 2016 at 6:36 AM, Håvard Espeland wrote: > Hi guys, > > We have been working on Prores Anatoliy optimizations to get the speed up > on an embedded x86 platform. Fdct (10bit), scaling and encoding of code > words have been optimized with AVX2 instructions, and the perfor

[FFmpeg-devel] Prores encoding optmizations

2016-05-23 Thread Håvard Espeland
Hi guys, We have been working on Prores Anatoliy optimizations to get the speed up on an embedded x86 platform. Fdct (10bit), scaling and encoding of code words have been optimized with AVX2 instructions, and the performance is increased by roughly 45% for the standard profile Prores 4:2:2 on o