On 10/11/22 12:30, Natarajan, Rajeswari wrote:
We have a six node solrcloud cluster and we have about 780 collections each
having one shard and 3 replicas. We have a situation where now create/delete
collection times out and when we try async option , the job gets submitted and
remains like that for hours. Tried restarting the solr nodes ,nothing changed.
CPU (~0%) usage and heap (<70%) are good .
In the solradmin UI , cloud ->Tree section takes long time , looks like it
takes times to connect to zk. Thinking of deleting collections manually in disk
and in zk . Is there any other solution to get around this issue , don’t see any
error in the logs.
That many collections will lead to problems. SolrCloud has a
scalability problem when the number of collections gets beyond a few
hundred. I did some investigation into this a while back.
https://issues.apache.org/jira/browse/SOLR-7191
This issue was marked as resolved, though no code was committed in
connection with the issue. Later tests that I did suggest that the
problem has gotten worse, not better, since version 6.x. It wasn't a
rigorous re-test, so I have no hard numbers.
Thanks,
Shawn