Nadeem Vawda <nadeem.va...@gmail.com> added the comment: > Also, I want to take a fresh look at the cache strategy (saving diffs > of two files vs saving file contents individually) and think about > whether than makes any sense at all for real world use cases > (is there a common need to compare the same file pairs over and over > again or is the typical use the comparison of many different file > pairs). There may even be a better way to approach the underlying > problem using hashes of entire files (md5, sha1, etc).
I like that idea. A hash-based approach could speed up the detection of non-equal files quite a bit. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue11802> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com