28.07.13 22:59, Roy Smith написав(ла):
The input is an 8.8 Mbyte file containing about 570,000 lines (11,000 unique strings).
Repeat you tests with totally unique lines.
The full profiler dump is at the end of this message, but the gist of it is:
Profiler affects execution time. In particular it slowdown Counter implementation which uses more function calls. For real world measurement use different approach.
Why is count() [i.e. collections.Counter] so slow?
Feel free to contribute a patch which fixes this "wart". Note that Counter shouldn't be slowdowned on mostly unique data.
-- http://mail.python.org/mailman/listinfo/python-list