Re: Using Lucene as a Document Comparison Tool

2019-12-13 Thread Michael Sokolov
Have you tried making a BooleanQuery with a term for every word in the query document as Optional? You will get a lot of matches, ranked according to the similarity. On Thu, Dec 12, 2019 at 10:47 AM John Brown wrote: > > Hi, > > > > I have some questions about how to use Lucene for the specific

Using Lucene as a Document Comparison Tool

2019-12-12 Thread John Brown
Hi, I have some questions about how to use Lucene for the specific purpose of finding document similarities. Lucene seems to have classes that were made for this, including: ClassicSimilarity and BM25Similarity. However I’m fumbling a bit when it comes to implementing them. >From what I under