Re: Indexing Approach

2018-06-28 Thread Shawn Heisey
On 6/27/2018 8:59 PM, solrnoobie wrote: > One last thing though, will a queue based system work for us? Or are there > better suitable implementations? Exactly how you write your indexing software is up to you.  There is no single approach that's the best.  Examine your business needs and the thin

Re: Indexing Approach

2018-06-27 Thread solrnoobie
Thank you very much! This helped a lot in our estimates. One last thing though, will a queue based system work for us? Or are there better suitable implementations? -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Re: Indexing Approach

2018-06-26 Thread Shawn Heisey
On 6/26/2018 8:24 AM, solrnoobie wrote: > - Each SP call will return 15 result sets. > - Each document can contain 300-1000 child documents. > - If the batch size is 1000, the child documents for each can contain > 300-1000 documents so that will eat up the 4g's allocated to the > application. If

Re: Indexing Approach

2018-06-26 Thread solrnoobie
Thanks for the tip. Although we have increased our application's heap to 4g and it is still not enough. I guess here are the things we think we did wrong: - Each SP call will return 15 result sets. - Each document can contain 300-1000 child documents. - If the batch size is 1000, the child docum

Re: Indexing Approach

2018-06-26 Thread Shawn Heisey
On 6/26/2018 12:06 AM, solrnoobie wrote: We are having errors such as heap space error in our indexing so we decided to lower the batch size to 50. The problem with this is that sometimes it really does not help since 1 document can contain 1000 child documents and it will still have the heap err

Re: Indexing Approach

2018-06-26 Thread solrnoobie
1. We have 5 nodes and 3 zookeepers (will autoscale if needed) 2. We use java with the help of solrj / spring data for indexing. 3. We see the exception in our application so this is probably our fault and not solr's so I'm asking what is the best approach for documents with a lot of child docume

Re: Indexing Approach

2018-06-25 Thread Aroop Ganguly
Would you mind sharing details on 1. the Solr Cloud setup, how may nodes do you have at your disposal and how many shards do you have setup ? 2. The indexing technology, what are you using? Core java/.net threads ? Or a system like spark ? 3. Where do you see the exceptions? The indexer process l