Hi Simon,
> that is what my first guess was and I'm pretty sure that the long time
> is taken before the documents get scored. A short prefix can easily
> expand to thousands of terms, do you encounter
> TooManyClausesExceptions and in turn do you set
> BooleanQuery#setMaxClauseCount() to a higher
t
> BooleanQuery#setMaxClauseCount() to a higher value than 1024?
> I wonder if BooleanQuery#setAllowDocsOutOfOrder(true) would give you
> any performance hit if you don't care about the order of how the docs
> come in. Any idea how many terms your prefix query expands to?
>
> sim
iginal message -----
> From: simon.willnauer [at] googlemail
> Re: Using HitCollector to Collect First N Hits
>
> Hi Len,
> what kind of query do you execute when you collect the hits.
> HitCollector should be called for each document by the time it is
> scored. Is it possible that you r
Using HitCollector to Collect First N Hits
Hi Len,
what kind of query do you execute when you collect the hits.
HitCollector should be called for each document by the time it is
scored. Is it possible that you run a query that could be expensive in
terms of term expansion like WildcardQuery?
Hi Len,
what kind of query do you execute when you collect the hits.
HitCollector should be called for each document by the time it is
scored. Is it possible that you run a query that could be expensive in
terms of term expansion like WildcardQuery?
simon
On Sat, Aug 22, 2009 at 7:09 AM, Len Tak
arch.
Regards,
Rafis
--
View this message in context:
http://www.nabble.com/Using-HitCollector-to-Collect-First-N-Hits-tp25090722p25093309.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsubs
> I’m using Lucene 2.4.1 and I’m trying to use a custom
> HitCollector to collect only the first N hits (not the best hits) for
> performance.
You mean that you do not need score calculation therefore you do not want
results sorted by relevancy. Just you need is a Boolean Retrieval Model, right?
Hello,
Im using Lucene 2.4.1 and Im trying to use a custom HitCollector to collect
only the first N hits (not the best hits) for performance. I saw another
e-mail in this group where they mentioned writing a HitCollector which throws
an exception after N hits to do this. So I tried this approa