Re: Consider only documents of a category for IDF

2010-10-18 Thread Max Jakob
erm) ? > > The returned figure includes deleted docs but then the search term uses this > method too so should suffer from the same inaccuracy. > > Cheers > Mark > > > > ----- Original Message > From: Max Jakob > To: java-user@lucene.apache.org > Sent: Mon, 18

Consider only documents of a category for IDF

2010-10-18 Thread Max Jakob
Hi, I would like to change the IDF value of the Lucene similarity computation to "inverse document frequency inside category". Not the complete collection should be considered, but only the documents that have a certain category. The categories are stored as separate fields. The implementation be