Hi,

Doing what you describe won't work with SolrCloud. The reason for this is that Solr will have to merge a large number of records from the nodes to get the results you ask, and this is a really resource-intensive task. Deep paging for a limited number of results isn't much better since Solr will still have to do the huge merge. cursorMark, like Thomas suggested, is the only way to get all these results. See the documentation for more information:

https://solr.apache.org/guide/8_8/pagination-of-results.html#performance-problems-with-deep-paging

Best,
Ere

Puttaganti, Venkat kirjoitti 5.4.2022 klo 14.40:
Hi Team,
      I hope you are doing good. We have come across an issue/limitation with 
Solr cloud, when users are trying to query with around 2 million data as part 
of the response.
      The same query works fine and return the results with standalone Solr. 
Can you suggested if it requires any additional configurations.

We are using Solr 8.8 and Zookeeper 3.6.2.
Heap set to 24 GB
Could has three nodes. Solr and Zookeeper in the same EC2 box.

Thanks in advance.

Regards,
Venkat.


Information Classification: General


--
Ere Maijala
Kansalliskirjasto / The National Library of Finland

Reply via email to