Re: idf calculation in Lucene ?

2011-11-02 Thread David Ryan
ion, > org.apache.lucene.search.Searcher) > > On Mon, Oct 31, 2011 at 5:24 PM, David Ryan wrote: > > Thanks! Is there any way to extend the Similarity class to overwrite the > > behavior (e.g., using the max idf instead of the sum of each term idfs)? > > > > > > On Thu, Oct 27, 2011 at 5:

Re: idf calculation in Lucene ?

2011-10-31 Thread David Ryan
Thanks! Is there any way to extend the Similarity class to overwrite the behavior (e.g., using the max idf instead of the sum of each term idfs)? On Thu, Oct 27, 2011 at 5:41 AM, Robert Muir wrote: > On Thu, Oct 20, 2011 at 3:11 PM, David Ryan wrote: > > > > > However, in

idf calculation in Lucene ?

2011-10-20 Thread David Ryan
According to https://lucene.apache.org/java/3_4_0/api/core/org/apache/lucene/search/Similarity.html idf(t) = 1 + log ( numDocs/(docFreq+1)) For example, in the following example, ln(26

Scoring in Lucene

2011-10-05 Thread David Ryan
Hi, The defaulting scoring in Lucene uses tf x idf^2 instead of tf x idf . Does any have have insight that why not using tf x idf? Here is the note on score calculation. https://lucene.apache.org/java/3_4_0/api/core/org/apache/lucene/search/Similarity.html