Re: Efficiently determine where documents differ

2010-01-05 Thread Richard
On Jan 5, 9:46 am, "Gabriel Genellina" wrote: > En Mon, 04 Jan 2010 19:04:12 -0300, Richard escribió: > > > I have been using the difflib library to find where 2 large HTML > > documents differ. The Differ().compare() method does this, but it is > > very slow - atleast 100x slower than the unix d

Re: Efficiently determine where documents differ

2010-01-04 Thread Gabriel Genellina
En Mon, 04 Jan 2010 19:04:12 -0300, Richard escribió: I have been using the difflib library to find where 2 large HTML documents differ. The Differ().compare() method does this, but it is very slow - atleast 100x slower than the unix diff command. Differ compares sequences of lines *and* line

Efficiently determine where documents differ

2010-01-04 Thread Richard
Hello, I have been using the difflib library to find where 2 large HTML documents differ. The Differ().compare() method does this, but it is very slow - atleast 100x slower than the unix diff command. How can I efficiently determine where 2 documents differ in Python? (Ideally I am after the posi