2017-07-20 18:07 GMT+02:00 maxime taisant <maximetais...@hotmail.fr>: > From: Maxime Taisant <maximetais...@hotmail.fr>
> I am currently working on SSE optimisations for the dwt functions used to > decode JPEG2000. > For the moment, I have only managed to produce a SSE-optimized version of the > sr_1d97_float function > (with relatively good results). Please add some numbers. > + if (ARCH_X86) > + ff_sr_1d97_float_sse(line, mh, mh + lh); > + else > + sr_1d97_float(line, mh, mh + lh); This is not correct, look at ff_jpeg2000dsp_init_x86() and Jpeg2000DSPContext to see what has to be done. Others will comment on the asm code. Thank you, Carl Eugen _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel