Yonik Seeley wrote:
idf is dependent only on the corpus, not on the individual document.
The formula is here:
http://lucene.apache.org/java/docs/api/org/apache/lucene/search/Similarity.html
1+log(1/2) = 0.30685282
Thanks Yonik, whilst all is not yet completely clear, it is much more so!
Ant
On 11/7/06, Antony Bowesman <[EMAIL PROTECTED]> wrote:
I've been trying to understand how idf is arrived at from a query. I have a
single Document with 9 fields. One field "subject" has the phrase "RFC2822 -
Internet Message Format" and a second "body" has the contents of rfc2822.
The other fi
I've been trying to understand how idf is arrived at from a query. I have a
single Document with 9 fields. One field "subject" has the phrase "RFC2822 -
Internet Message Format" and a second "body" has the contents of rfc2822.
The other fields contain additional meta data. If I search for su