Steve Holden wrote: > This discussion seems to assume that Excel spreadsheets are stored in > some canonical form so that two spreads with the same functionality are > always identical on disk to the last bit. I very much doubt this is true > (consider as an example the file properties that can be set). > > So really you need to define "equality". So far the tests discussed have > concentrated on identifying identical files. > > regards > Steve
I was wondering myself if the OP was actually interested in binary identical files, or just duplicated content. If just duplicated content, perhaps this could be used as a starting point: http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/440661 and the actual data can be compared -Jay -- http://mail.python.org/mailman/listinfo/python-list