> > I tried your code (with one change, time on feedback lines) and got the > > same terrible > > performance against my data set. > > > > To prove that my machine is sane, I ran the same against your generated >> sample file and got _excellent_ performance. Start to finish in under a minute. > > One possibility could be that your dataset turns out to be some sort > of pathological worst case for the hashing algorithm in python.
Cool! Putting that on the resume. ;) -- http://mail.python.org/mailman/listinfo/python-list