Re: Solr Cloud - Query with results around 2 million records time out.

Ere Maijala Tue, 05 Apr 2022 05:33:44 -0700

Hi,

Doing what you describe won't work with SolrCloud. The reason for thisis that Solr will have to merge a large number of records from the nodesto get the results you ask, and this is a really resource-intensivetask. Deep paging for a limited number of results isn't much bettersince Solr will still have to do the huge merge. cursorMark, like Thomassuggested, is the only way to get all these results. See thedocumentation for more information:

https://solr.apache.org/guide/8_8/pagination-of-results.html#performance-problems-with-deep-paging


Best,
Ere

Puttaganti, Venkat kirjoitti 5.4.2022 klo 14.40:

Hi Team,
      I hope you are doing good. We have come across an issue/limitation with 
Solr cloud, when users are trying to query with around 2 million data as part 
of the response.
      The same query works fine and return the results with standalone Solr. 
Can you suggested if it requires any additional configurations.

We are using Solr 8.8 and Zookeeper 3.6.2.
Heap set to 24 GB
Could has three nodes. Solr and Zookeeper in the same EC2 box.

Thanks in advance.

Regards,
Venkat.


Information Classification: General


--
Ere Maijala
Kansalliskirjasto / The National Library of Finland

Re: Solr Cloud - Query with results around 2 million records time out.

Reply via email to