Hi, On Mon, 28 Jul 2008, Agner Fog wrote:
> Glibc 2.8 is still almost 5 times slower than the best function > libraries for unaligned data on Intel Core 2, and the default builtin > function is slower than any other implementation I have seen (copies 1 > byte at a time!). You must be doing something wrong. If the compiler decides to inline the string ops it either knows the size or you told it to do it anyway (-minline-all-stringops or -minline-stringops-dynamically). In both cases will it use wider than byte moves when possible. Ciao, Michael.