I tried to override the default lengthNorm method with the suggestion in
this link
https://issues.apache.org/jira/browse/LUCENE-2187.
But it will not work because not every number of terms from 1 to 10 has an
unique score.
Here is my solution, which only works for shorter fields. Welcome any
crit
Ahmet and Ian:
Thanks to both of you very much. I will try the patch.
Qi
On Wed, Dec 29, 2010 at 9:00 AM, Ahmet Arslan wrote:
> > Test case
> > doc1 : test -- one two
> > three
> > doc2 : test, one two three
> > doc3 : one two three
> >
> > Search query : "one two three" by
> Test case
> doc1 : test -- one two
> three
> doc2 : test, one two three
> doc3 : one two three
>
> Search query : "one two three" by QueryParser and
> StandardAnalyzer
>
> Question: why all of three documents have the same
> score?
As Ian said, length norm values of your a
Some of the factors that go in to the score calculation are encoded as
a byte with inevitable loss of precision. Maybe length is one of
these and lucene is not differentiating between your 3 and 4 word
docs. Try indexing a document that is significantly longer than 3 or
4 words.
Further reading:
Happy Holidays !
Test case
doc1 : test -- one two three
doc2 : test, one two three
doc3 : one two three
Search query : "one two three" by QueryParser and StandardAnalyzer
Question: why all of three documents have the same score? I really want
the doc3 has higher score becaus