Re: search performance

Jamie Fri, 20 Jun 2014 00:50:16 -0700

Hi All

Thank you for all your suggestions. Some of the recommendations hadn'tyet been implemented, as our code base was using older versions ofLucene with reduced capabilities. Thus, far, all the recommendationsfor fast search have been implemented (e.g. using pagination withsearchAfter, DirectoryReader.openIfChanged, avoiding wrapping lucenescoreDoc results, option to disable sorting, etc.).

While, in some environments, search performance has improvedsignificantly, in other larger ones we are unfortunately, still seeing 1minute - 5 minute search times. For instance, in one site, the totalindex size is 500GB with 190 million documents indexed. They are runninga machine with 24 core and 4 SSD drives to house the indexes. New emailsare being added to the indexes at a rate of 10 message/sec.

One area possible area for improvement: Searching is being conductedacross several indexes. To accomplish this, on each search, aMultiReader is constructed, that consists of several subreaders createdby the DirectoryReader.openIfChangedMethod. Only one of the indexes isupdated frequently, the others are never updated. For each search, anew IndexSearcher is created passed the MultiReader in the constructor.From what I've read, MultiReader and IndexSearcher are relativelylightweight and should not impact search performance. Is this correct?Is there a faster way to handle searching across multiple indexes? Whatis the performance impact of searching across multiple indexes?

Am I correct that using SearchManager can't be used with a MultiReaderand NRT? I would appreciate all suggestions on how to optimize oursearch performance further. Search time has become a usability issue.


Much appreciate

Jamie

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: search performance

Reply via email to