RE: Using HitCollector to Collect First N Hits

2009-08-24 Thread Len Takeuchi
Hi Simon, > that is what my first guess was and I'm pretty sure that the long time > is taken before the documents get scored. A short prefix can easily > expand to thousands of terms, do you encounter > TooManyClausesExceptions and in turn do you set > BooleanQuery#setMaxClauseCount() to a higher

Re: Using HitCollector to Collect First N Hits

2009-08-22 Thread Len Takeuchi
Hello, I have attached the original thread from where I got my information at the very bottom in case it is of any help. In regards to whether I want just a boolean retrieval model, in the usage we are currently discussing, the answer is yes (I don't care about the score). However, we also do

Using HitCollector to Collect First N Hits

2009-08-21 Thread Len Takeuchi
Hello, I’m using Lucene 2.4.1 and I’m trying to use a custom HitCollector to collect only the first N hits (not the best hits) for performance. I saw another e-mail in this group where they mentioned writing a HitCollector which throws an exception after N hits to do this. So I tried this approa