On Friday 11 December 2009 18:36:33 Noah Pugsley wrote: > STeve Andre' wrote: > > I am wondering if there is a port or otherwise available > > code which is good at comparing large numbers of files in > > an arbitrary number of directories? I always try avoid > > wheel re-creation when possible. I'm trying to help some- > > one with large piles of data, most of which is identical > > across N directories. Most. Its the 'across dirs' part > > that involves the effort, hence my avoidance of thinking > > on it if I can help it. ;-) > > > > Thanks, STeve Andre' > > > > Compare how?
I should have been more clear I suppose. I'd like to know the files that are identical, files that are of the same name but different across directories, possibly several directories. What I have is a large clump of data in the form of some huge number of reletively small files, which were extracted out of a database as individual files. I am not responsible for this(!) but am trying to come up with a reasonable way of spotting duplicates, etc. Some files have the same name (and even some with the same size) but are different. It's a mess, but the original database died and all I have are peices, kind of like shards from a large piece of pottery that just got smashed. I'm not even sure what all the data looks like at this point--I can only assume its going to be ugly, no thought about this when the files were created. --STeve Andre'