Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-28 Thread Pratik Patel
Thanks Endika! https://issues.apache.org/jira/browse/SOLR-14923 @DavidSmiley do you think this could be related to the issue I have described? I will certainly update our solr image but it will be good to know the root cause of the issue. Your comment on this would be very helpful. Thanks On

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-28 Thread Endika Posadas
There were some big changes related to child indexing in solr 8.8, under this ticket: https://issues.apache.org/jira/browse/SOLR-14923 It's worth updating solr to latest 8.8 and trying again, perhaps your indexing issue has already been fixed. On 2021/07/27 19:44:13, Pratik Patel wrote: > So

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-27 Thread Pratik Patel
So it looks like I have narrowed down where the problem is and have also found a workaround but I would like to understand more. As I had mentioned, we have two stages in our bulk indexing operation. stage 1 : index Article documents [A1, A2.An] stage 2 : index Article documents with children

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-23 Thread Pratik Patel
Interesting! I will certainly test this. What interval would you suggest for the soft commits? Also, is there a way to disable real-time get so that we can disable soft commits? Triggering a soft commit would open new searcher and recreate caches, we would like to avoid it if possible as there's n

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-23 Thread Joel Bernstein
Whether you use real-time-get or not you still need to soft commit to release the memory used to support real-time-get. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jul 23, 2021 at 3:39 PM Pratik Patel wrote: > Thanks for the response Joel. > > We do not use "Real-time-get" queries. A

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-23 Thread Pratik Patel
Thanks for the response Joel. We do not use "Real-time-get" queries. Also, we don't query the index while a particular stage of bulk indexing is going on. Would it still help to enable soft commits? On Fri, Jul 23, 2021 at 3:16 PM Joel Bernstein wrote: > First thing to try is turning on softcom

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-23 Thread Joel Bernstein
First thing to try is turning on softcommits. You need to open new searchers while indexing to free up the memory used to support real-time-get queries. Real-time-get supports queries on uncommitted data, so to support this a memory component is needed for records that are indexed, but not yet visi

Re: Commit strategy for Heavy Bulk Indexing into solr

2021-07-23 Thread Pratik Patel
Solr Cloud version is 8.5. I have also attached the solr log with gc enabled and our app log which shows that there was SocketTimeoutException. On Fri, Jul 23, 2021 at 2:31 PM Pratik Patel wrote: > Hi All, > > *tl;dr* : running into long GC pauses and solr client socket timeouts > when indexing