Re: How do I implement "find documents like document x."

2005-09-19 Thread Grant Ingersoll
I believe there a several ways of doing it. You can use the MoreLikeThis contribution at http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/similarity or you can roll your own using the TermVector implementation. Basically, do your first search, get the term vector from the document you ar

How do I implement "find documents like document x."

2005-09-19 Thread Peter Gelderbloem
Hi I was wondering how would you search for documents similar to a specified document using Lucene? The context would be that I categorise document A manually, and then search for documents with similar terms. Hopefully the documents returned would be in the same category/theme as document A. The