Scoring Across Multiple Fields

2020-01-27 Thread John Brown
Hi, I have a question regarding how Lucene computes document similarities from field similarities. Lucene's scoring documentation mentions that scoring works on fields and combines the results to return documents. I'm assuming fields are given scores, and those scores are simply averaged to retur

Using Lucene as a Document Comparison Tool

2019-12-12 Thread John Brown
Hi, I have some questions about how to use Lucene for the specific purpose of finding document similarities. Lucene seems to have classes that were made for this, including: ClassicSimilarity and BM25Similarity. However I’m fumbling a bit when it comes to implementing them. >From what I under