Re: Slow queries with lots of hits

Karl Wettin Thu, 04 Dec 2008 22:26:32 -0800

Hi Tim,

is it possible that the slow queries contains terms that are verycommon in your index? If so you could replace those clauses with afilter. This would impact the score as filters does nothing with that,but if your query contains enough other clauses that should not be aproblem.

That is how I've in several applications managed to solve the problemyou describe.



     karl

4 dec 2008 kl. 21.27 skrev Tim Sturge:

Hi all,
I have an interesting problem with my query traffic. Most of thequeries runin a fairly short amount of time (< 100ms) but a few take over1000ms. Thesequeries are predominantly those with a huge number of hits (>1million hitsin a >100 million document index). The time taken (as far as I cantell) is
for lucene to sit there while it scores and sorts all these results.
However it turns out these queries really don’t have top results.That is,of the million documents, there are easily 10000 which are decentresults(basically those above some threshold score). Frankly, justreturning some
consistent (so paging and reload work) but
otherwise arbitrary ranking of these 10000 results would be morethan good
enough.
It seems to me that a solution would be to impose some sort ofpseudo-randomfilter (e.g. consider only every n-th document assuming they areuniformlydistributed). I’m wondering if anyone else has experience with thissort of
issue and what solutions they have found to work well in practice.

Thanks,

Tim



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Slow queries with lots of hits

Reply via email to