Re: Help: tweaking search - reducing IDF skew and implementing score cutoff

2006-02-10 Thread Chris Lamprecht
> 2. If I choose to sort the results by date, then recent documents with > very very low relevancy (say the words searched appears only in > content, and not in title/bylines/summary fields that are boosted > higher) are still shown relatively high in the list, and I wish to > omit them in general.

Re: Help: tweaking search - reducing IDF skew and implementing score cutoff

2006-02-09 Thread Chris Hostetter
: Sunday gets ranked highly due to idf. How do I reduce this skewness : due to the date-posted field? I saw a reference earlier to : ConstantScoreRangeQuery on JIRA - is it the solution? Yes. RangeQuery expands to a BooleanQuery containing all of the terms in the. The number of terms (and the fr

Help: tweaking search - reducing IDF skew and implementing score cutoff

2006-02-09 Thread Chun Wei Ho
Hi, I am running a search for something akin to a news site, when each news document has a date, title, keywords/bylines, summary fields and then the actual content. Using Lucene for this database of documents, it seems that: 1. The relevancy score is skewed drastically by the actual number of ne