Op Friday 05 September 2008 16:57:34 schreef Mark Miller: > Paul Elschot wrote: > > Op Thursday 04 September 2008 20:39:13 schreef Mark Miller: > >> Sounds like its more in line with what you are looking for. If I > >> remember correctly, the phrase query factors in the edit distance > >> in scoring, but the NearSpanQuery will just use the combined idf > >> for each of the terms in it, so distance shouldnt matter with > >> spans (I'm sure Paul will correct me if I am wrong). > > > > SpanScorer will use the similarity slop factor for each matching > > span size to adjust the effective frequency. > > The span size is the difference in position between the first > > and last matching term, and idf is not used for scoring Spans. > > The reason why idf is not used could be that there is no basic > > score value associated with inner spans; only top level spans > > are scored by SpanScorer. > > For more details, please consult the SpanScorer code. > > > > Regards, > > Paul Elschot > > Right, my fault, its the query normalization in the weight which uses > idf (by pulling from each clause in the span). So its kind of > factored into the score, but not in the way I implied. Sorry, my bad > on the info.
Well, I had missed the phrase idf over all the SpanQuery terms as used from the SpanWeight. Regards, Paul Elschot --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]