RE: Using HitCollector to Collect First N Hits

2009-08-24 Thread Len Takeuchi
Hi Simon, > that is what my first guess was and I'm pretty sure that the long time > is taken before the documents get scored. A short prefix can easily > expand to thousands of terms, do you encounter > TooManyClausesExceptions and in turn do you set > BooleanQuery#setMaxClauseCount() to a higher

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread Simon Willnauer
t > BooleanQuery#setMaxClauseCount() to a higher value than 1024? > I wonder if BooleanQuery#setAllowDocsOutOfOrder(true) would give you > any performance hit if you don't care about the order of how the docs > come in. Any idea how many terms your prefix query expands to? > > sim

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread Simon Willnauer
iginal message ----- > From: simon.willnauer [at] googlemail > Re: Using HitCollector to Collect First N Hits > > Hi Len, > what kind of query do you execute when you collect the hits. > HitCollector should be called for each document by the time it is > scored. Is it possible that you r

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread Len Takeuchi
Using HitCollector to Collect First N Hits Hi Len, what kind of query do you execute when you collect the hits. HitCollector should be called for each document by the time it is scored. Is it possible that you run a query that could be expensive in terms of term expansion like WildcardQuery?

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread Simon Willnauer
Hi Len, what kind of query do you execute when you collect the hits. HitCollector should be called for each document by the time it is scored. Is it possible that you run a query that could be expensive in terms of term expansion like WildcardQuery? simon On Sat, Aug 22, 2009 at 7:09 AM, Len Tak

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread Rafis
arch. Regards, Rafis -- View this message in context: http://www.nabble.com/Using-HitCollector-to-Collect-First-N-Hits-tp25090722p25093309.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubs

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread AHMET ARSLAN
> I’m using Lucene 2.4.1 and I’m trying to use a custom > HitCollector to collect only the first N hits (not the best hits) for > performance.  You mean that you do not need score calculation therefore you do not want results sorted by relevancy. Just you need is a Boolean Retrieval Model, right?

Using HitCollector to Collect First N Hits

2009-08-21 Thread Len Takeuchi
Hello, I’m using Lucene 2.4.1 and I’m trying to use a custom HitCollector to collect only the first N hits (not the best hits) for performance. I saw another e-mail in this group where they mentioned writing a HitCollector which throws an exception after N hits to do this. So I tried this approa