Re: Batch replica transfer from master shard solr cloud

Erick Erickson Fri, 17 Oct 2014 11:06:28 -0700

Pretty sure that's never been made configurable.

I've seen anecdotal evidence of a 30-40% slowdown when adding the
first replica, from there the penalty is much less.


Cp Mishra:
Any time you change code you're absolutely invited to open a JIRA and
attach the code for people to look at. Please feel free to!

Best,
Erick

On Fri, Oct 17, 2014 at 12:31 PM, Shawn Heisey <[email protected]> wrote:
> On 10/17/2014 8:50 AM, Cp Mishra wrote:
>>
>> So, we changed the logic to:
>>
>> -Read SolrInputDocument objects from stream in batches of 500.
>>
>> -Add  documents to ConcurrentUpdateSolrServer instance
>>
>> -Index documents in a loop
>>
>> This has improved indexing speed significantly.
>>
>> What are the caveats to this approach?
>>
>
> Thinking back to when we were having deadlock problems with heavy indexing
> on SolrCloud, I seem to recall that one of the experts said that SolrCloud
> already does batch the documents, only it was 10 at a time.  I also seemed
> to remember that making the batch size configurable was discussed, but I
> don't know how discussion ended.  Am I remembering incorrectly?
>
> I'm not familiar with the actual code for this part of Solr at all.
>
> Thanks,
> Shawn
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: Batch replica transfer from master shard solr cloud

Reply via email to