Pretty sure that's never been made configurable. I've seen anecdotal evidence of a 30-40% slowdown when adding the first replica, from there the penalty is much less.
Cp Mishra: Any time you change code you're absolutely invited to open a JIRA and attach the code for people to look at. Please feel free to! Best, Erick On Fri, Oct 17, 2014 at 12:31 PM, Shawn Heisey <[email protected]> wrote: > On 10/17/2014 8:50 AM, Cp Mishra wrote: >> >> So, we changed the logic to: >> >> -Read SolrInputDocument objects from stream in batches of 500. >> >> -Add documents to ConcurrentUpdateSolrServer instance >> >> -Index documents in a loop >> >> This has improved indexing speed significantly. >> >> What are the caveats to this approach? >> > > Thinking back to when we were having deadlock problems with heavy indexing > on SolrCloud, I seem to recall that one of the experts said that SolrCloud > already does batch the documents, only it was 10 at a time. I also seemed > to remember that making the batch size configurable was discussed, but I > don't know how discussion ended. Am I remembering incorrectly? > > I'm not familiar with the actual code for this part of Solr at all. > > Thanks, > Shawn > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
