5-10 G indexes are pretty small by Lucene/Solr standards, so
given reasonable hardware resources this should be no problem.
That said, only measurement will nail this down. But an
often-used rule of thumb is that you need to consider some
better strategies in the 40G range.

CAUTION: you haven't specified what hardware you're running on,
64/32 bit? Memory available? Other things running on your
machine?

You might want to review:
http://wiki.apache.org/solr/UsingMailingLists

As to your second question, why do you care? If you simply
optimize every so often (say daily or weekly) you'll reclaim all
the space anyway and avoid operational/programmatic
complexity. I'd venture that you won't notice any performance
difference unless you're changing your documents at a
furious rate.

Best
Erick

On Thu, Mar 31, 2011 at 8:14 AM, sol myr <solmy...@gmail.com> wrote:
> Hi,
>
> I'm rephrasing a previous performance question, in light of new data...
> I have a Lucene index of about 0.5 GB.
> Currently performance is good - up to 200 milliseconds per search (with
> complex boolean queries, but never retrieving more than 200 top results).
>
> The question: how much can the index grow, before there's noticeable
> performance degradation?
>
> 1) Does anyone please have production experience with, say, 5 GB index? 10
> GB?
> If so, are there recommendations about merge policy, file size
> configuration, etc?
> If it degrades, I have other solutions (involving a change in logic), but I
> don't want to get into it unless necessary.
>
> 2) Also, about 5% of my documents are editable (= the application
> occasionally deletes them, and adds a modified document instead).
> The other 90% are "immutable" (never deleted/edited).
> Can Lucene take advantage of this? E.g. will it be smart enough to keep
> changes in a single small file (which needs to be optimized), while the
> other files remain unchanged?
>
> Thanks :)
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to