On Friday 11 December 2009 18:36:33 Noah Pugsley wrote:
> STeve Andre' wrote:
> >    I am wondering if there is a port or otherwise available
> > code which is good at comparing large numbers of files in
> > an arbitrary number of directories?  I always try avoid
> > wheel re-creation when possible.  I'm trying to help some-
> > one with large piles of data, most of which is identical
> > across N directories.  Most.  Its the 'across dirs' part
> > that involves the effort, hence my avoidance of thinking
> > on it if I can help it. ;-)
> > 
> > Thanks, STeve Andre'
> > 
> 
> Compare how?

I should have been more clear I suppose.  I'd like to know
the files that are identical, files that are of the same
name but different across directories, possibly several
directories.

What I have is a large clump of data in the form of some
huge number of reletively small files, which were extracted
out of a database as individual files.  I am not responsible
for this(!) but am trying to come up with a reasonable way
of spotting duplicates, etc.  Some files have the same
name (and even some with the same size) but are different.
It's a mess, but the original database died and all I have
are peices, kind of like shards from a large piece of pottery
that just got smashed.  I'm not even sure what all the data
looks like at this point--I can only assume its going to be
ugly, no thought about this when the files were created.

--STeve Andre'

Reply via email to