(13/07/11 22:56), gtkesh wrote:
Hi everyone! I have two questions:
1. What are the cases where Lucene's default tf-idf overperforms BM25? What
are the best use cases where I should use tf-idf or BM25?
2. Are there any user-friendly guide or something about how can I use BM25
algorithm instead of Lucene's default tf-idf? I tried to search but couldn't
find anything useful.
For #2, you may find that the following is useful...
http://lucene.apache.org/core/4_3_1/core/org/apache/lucene/search/similarities/package-summary.html#changingSimilarity
koji
--
http://soleami.com/blog/automatically-acquiring-synonym-knowledge-from-wikipedia.html
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org