On May 21, 2006, at 11:31 AM, Marcus Falck wrote:
I will use Lucene to index 200 million documents (doc size 2kb -> 20 kb).
With the following requirements:
IndexSearcher needs to be created atleast every 5 minute.
The ranking/scoring/sorting will need to reply the hits ordered by date desc.

Will the sorting be good enough on a machine with 4GB of RAM? Since the cache will be destroyed every time I open a new IndexSercher i need to know how long it will take to rebuild it (the index is stored on a SAN).

My hunch is that sorting will be fine, but you'll want to warm up the caches before putting a new IndexSearcher into production use. Solr does this by running a configurable set of queries against an index before swapping out the old IndexSearcher with a new warmed up one.

Or do I have to change the scoring so I wont need to access the date field?

If i will have to change the scoring does anybody have any tips on where to begin?

Solr's FunctionQuery (isn't it time we move this to the core of Lucene?! :) might be able to help you out in this situation, but it sounds like sorting is what you want to use, not scoring tweaks.

        Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to