Hi,
Doing what you describe won't work with SolrCloud. The reason for this
is that Solr will have to merge a large number of records from the nodes
to get the results you ask, and this is a really resource-intensive
task. Deep paging for a limited number of results isn't much better
since Solr will still have to do the huge merge. cursorMark, like Thomas
suggested, is the only way to get all these results. See the
documentation for more information:
https://solr.apache.org/guide/8_8/pagination-of-results.html#performance-problems-with-deep-paging
Best,
Ere
Puttaganti, Venkat kirjoitti 5.4.2022 klo 14.40:
Hi Team,
I hope you are doing good. We have come across an issue/limitation with
Solr cloud, when users are trying to query with around 2 million data as part
of the response.
The same query works fine and return the results with standalone Solr.
Can you suggested if it requires any additional configurations.
We are using Solr 8.8 and Zookeeper 3.6.2.
Heap set to 24 GB
Could has three nodes. Solr and Zookeeper in the same EC2 box.
Thanks in advance.
Regards,
Venkat.
Information Classification: General
--
Ere Maijala
Kansalliskirjasto / The National Library of Finland