I'm working on a problem where I need to search over 160 million
documents.  I know Lucene can do this no sweat; my problem is that these
documents are grouped in more then 500 categories.  I need to get a
count of documents that match a given query, within each category.
There is no need for scoring the documents or even access the documents,
I just need the count.

Currently I'm using an index per category so I can access the total
number of hits quickly.  I've tried to use a custom HitsCollector object
and one large index to achieve the same thing but found that it was 3 to
4 times slower then iterating over 500 individual indexes.

Searches are sometimes taking more than 60 sec to run and can return
counts in the millions.

So my overall question is can this be done??  Any suggestions would be
helpful.

Thanks

Tim


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to