On Mon, Apr 09, 2018 at 03:00:17AM +0200, Michael Niedermayer wrote: > On Sun, Apr 08, 2018 at 03:33:17PM +0200, Martin Vignali wrote: > > Hello, > > > > Patchs in attach remove mmxext version of shufflebytes2103 and uyvyto422 > > > > Checkasm test show that mmxext version is slower than MMX version > > > > Also disable MMXext for "internal" func : extract_even2 > > (only use by uyvyto422) > > are you testing these functions with real image data in a real image > processing pipeline ? > > synthetic tests would likely not give correct results with non temporal > stores. > in a real processing pipeline the output image will generally not fit in > the L1 cache and will generally not be reused before other things overwrote > the cache. So it should be better not to waste cache for it in the first > place. > But a synthetic test which just repeatly filters a short line will > behave very differently. > Its thus expected that mmx2 performs poorly here in such a synthetic test > but that doesnt say anything about how it performs in a real use case
also IIRC all the really old SIMD code has been benchmarked when it was written in real use cases, so if a function that is favoured is slower thats "strange". And the question should be asked "why" its slower now [...] -- Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB It is dangerous to be right in matters on which the established authorities are wrong. -- Voltaire
signature.asc
Description: PGP signature
_______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel