Le torstaina 15. kesäkuuta 2023, 17.58.37 EEST Arnie Chang a écrit : > Since these functions are frequently called, I prefer instantiating similar > code many times > rather than calling another internal function, as it may introduce > additional function call overhead.
This works both ways. Smaller code reduces IC overhead and the risk of its own eviction or that of some other frequently used code. Here, we would just add one `li` to the 8x cases, and a pair of `li` and `j` to the 2x and 4x cases (like we already do for Opus postfilter). Indeed, since this is assembler, we can enforce tail-call optimisation. Since this is assembler, you can count on tail-call optimisation. This is really just one `li` and `j` added on the 2 and 4. Not that I could measure the actual impact of either approaches. -- Rémi Denis-Courmont http://www.remlab.net/ _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".