I recently played around with a 2 million doc index of docs that averaged
between 2-10k. The system had 4 gig of ram and a 3 gig dual core proc (not
using a parallel searcher to take advantage of the extra core)...pretty
beefy, but with 4 times the docs your talking about. I didn't see a query
that took over a second without a sort.

A similar setup on a single core 3200+ AMD 64 with a gig of ram was also
blazingly fast (no sorts involved again).

- Mark

On 10/12/06, Scott Smith <[EMAIL PROTECTED]> wrote:

Supposed I want to index 500,000 documents (average document size is
4kBs).  Let's assume I create a single index and that the index is
static (I'm not going to add any new documents to it).  I would guess
the index would be around 2GB.



Now, I do searches against this on a somewhat beefy machine (2GB RAM,
Core 2 Duo, Windows XP).  Does anyone have any idea what kinds of search
times I can expect for moderately complicated searches (several sets of
keywords against several fields)?  Are there things I can do to increase
search performance?  For example, does Lucene like lots of RAM, lots of
CPU, faster HD, all of the above?  Am I better splitting the index file
into 2 (N?) versions and search on multiple indexes simultaneously?



Anyone have any thoughts about this?



Scott





Reply via email to