Kåre Krig <karek...@gmail.com> added the comment:

I tried it again with another file. This time I used the dictionary from 
www.math.sjsu.edu/~foster/dictionary.txt  (~3Mb)

hash(buff_A) == hash(buff_B)  returns False just like the direct comparison. I 
ran the program on dictionary.txt and printed buff_A & buff_B to two different 
files. When running diff on those files the reported differences where:

149668c149668
< intraisland
---
> intrqisland
150052c150052
< invernacular
---
> ynvernacular
230933c230933
< perwitsky
---
> perwitski


For my first run, then immediatly running the same script and doing diff again 
produced another set of differences

253803c253803
< recrown
---
> recrow~
254213c254213
< redisseise
---
> bedisseise
254656c254656
< reflectors
---
> beflectors
255083c255083
< regrating
---
> regratinw


Note how the ascii codes for the faulty characters only differ by one bit, and 
only the 5th least significant bit. This is consistent with my previous tests.

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue12784>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to