Hello, in attach new patch for SSE simd of reorder pixels in exr decoder (use by zip and rle uncompress), after comments on the previous patch by Ivan Kalvachev.
After testing only on a small buffer, i fix the overread problem of the previous patch (who hid the last loop trouble) pass fate test for me (on Mac Os X) Tested with the decoding of a sequence of 150 HD Exr images (CGI render with 17 layers per file in float pixel, ZIP16 compression) SSE : 349190 decicycles in reorder_pixels_zip, 130716 runs, 356 skips bench: utime=109.222s bench: maxrss=607002624kB Scalar : 3039686 decicycles in reorder_pixels_zip, 130395 runs, 677 skips bench: utime=123.042s bench: maxrss=607019008kB Comments Welcome Martin Jokyo Images
0001-libavcodec-add-SSE-SIMD-for-reorder-pixels.patch
Description: Binary data
_______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel