Re: [FFmpeg-devel] [PATCH v3] mdct15: add assembly optimizations for the 15-point FFT
On Fri, Jun 23, 2017 at 12:44 AM, Rostislav Pehlivanov wrote: > +%macro FFT5 3 ; %1 - in_offset, %2 - dst1 (64bit used), %3 - dst2 > +movddup xm0, [inq + 0*16 + 0 + %1] ; in[ 0].re, in[ 0].im, in[ 0].re, > in[ 0].im > +movsd xm1, [inq + 1*16 + 8 + %1] ; in[ 3].re, in[ 3].im, 0
[FFmpeg-devel] [PATCH v3] mdct15: add assembly optimizations for the 15-point FFT
c:1802 decicycles in fft15,16774635 runs, 2581 skips fma3: 935 decicycles in fft15,16775893 runs, 1323 skips Signed-off-by: Rostislav Pehlivanov --- libavcodec/mdct15.c | 182 +-- libavcodec/mdct15.h | 26 +++ libavcodec/x86