Hi,

I am working currently on implementing sharding on current Solr Cloud
Cluster.
Main idea is to be able to scale horizontally.

At the moment, without sharding we have all collections sitting on all
servers.
We have as well pretty heavy streaming expressions returning many ids.
Average of 300,000 ids to join.

After  doing sharding I see a huge increase on CPU and memory usage.
Making queries way slower comparing sharding to not sharding.

I guess that's  expected bacuase the joins need to send data across servers
over network.

Any thoughs on best practices here. I guess a possible approach is to split
shards in more.

Regards
Sergio

Reply via email to