Re: Tuning MoreLikeThis scoring algorithm

2021-06-01 Thread TK Solr
Thank you for the information, Robert. The argument against the normalized score make sense for the regular kind of search where queries are much shorter than the documents. But MLT is a document vs document search. Can't we define 100% match as all terms are found in both documents at the same

Re: Tuning MoreLikeThis scoring algorithm

2021-05-28 Thread Robert Muir
See https://cwiki.apache.org/confluence/display/LUCENE/ScoresAsPercentages which has some broken nabble links, but is still valid. TLDR: Scoring just doesn't work the way you think. Don't try to interpret it as an absolute value, it is a relative one. On Fri, May 28, 2021 at 1:36 PM TK Solr wrot