Paul Eggert wrote: > Pádraig Brady <[EMAIL PROTECTED]> writes: > > >>There seems to be serious overhead with strcoll on glibc-2.3.5-10 at least. > > > We can fix the performance problem for that particular test case as > follows. I installed this (diff -pubw format): > > 2006-08-14 Paul Eggert <[EMAIL PROTECTED]> > > * memcoll.c (memcoll): Optimize for the common case where the > arguments are bytewise equal.
Cool, that's a good optimization, loads of improvement with not much downside: 1 = coreutils 5.97 uniq 2 = coreutils 5.97 uniq with above patch a = ascii long lines, with all same length (85 chars), and 26 identical lines for every 27 b = ascii long lines, with all same length (85 chars), and all adjacent lines different LANG=en_IE.UTF8 \ 1 2 --------------- a| 5.300 0.286 b| 0.438 0.449 Pádraig _______________________________________________ Bug-coreutils mailing list Bug-coreutils@gnu.org http://lists.gnu.org/mailman/listinfo/bug-coreutils