Hi Karl!
I'm interested in near duplicate detection based on termFreqVectos. Now I'm comparing all documents with each other (calculating the angle)... Is there a way to avoid that?

Thanks!
Beto

karl wettin wrote:

17 okt 2006 kl. 17.54 skrev Find Me:

How to eliminate near duplicates from the index?

Oh, one more thing. You should probably look at the norms in order to avoid comparing all documents to each other.



------------------------------------------------------------------------

No virus found in this incoming message.
Checked by AVG Free Edition.
Version: 7.1.408 / Virus Database: 268.13.4/477 - Release Date: 10/16/2006

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to