I recently played around with a 2 million doc index of docs that averaged between 2-10k. The system had 4 gig of ram and a 3 gig dual core proc (not using a parallel searcher to take advantage of the extra core)...pretty beefy, but with 4 times the docs your talking about. I didn't see a query that took over a second without a sort.
A similar setup on a single core 3200+ AMD 64 with a gig of ram was also blazingly fast (no sorts involved again). - Mark On 10/12/06, Scott Smith <[EMAIL PROTECTED]> wrote:
Supposed I want to index 500,000 documents (average document size is 4kBs). Let's assume I create a single index and that the index is static (I'm not going to add any new documents to it). I would guess the index would be around 2GB. Now, I do searches against this on a somewhat beefy machine (2GB RAM, Core 2 Duo, Windows XP). Does anyone have any idea what kinds of search times I can expect for moderately complicated searches (several sets of keywords against several fields)? Are there things I can do to increase search performance? For example, does Lucene like lots of RAM, lots of CPU, faster HD, all of the above? Am I better splitting the index file into 2 (N?) versions and search on multiple indexes simultaneously? Anyone have any thoughts about this? Scott