Three nodes with nginx in front will handle well over 50k searches a day on a 
half terabyte index, but only one node is to serve the searches the rest are 
backups. I would never put solr in a container 

> On Jul 17, 2022, at 10:44 AM, Shawn Heisey <apa...@elyograg.org> wrote:
> 
> On 7/17/22 07:40, Ronen Nussbaum wrote:
>> We are planning to migrate our Solr Cloud clusters to the cloud.
>> Currently it is installed on-prem for each customer.
>> It is already deployed as Docker containers.
>> Instead of estimating in advance what is the number of shards needed, or
>> the number of pods, we'd like to rely on EKS cluster autoscaler, K8S HPA
>> and Solr autoscaling.
>> Our main concern is the deprecation of the autoscaling feature since
>> version 9.0.
> 
> One thing I am not sure you're aware of:  You can't add shards to a 
> collection unless it is using the implicit router, which is poorly named 
> because what it means is that sharding is 100% user-managed (manual).
> 
> There is the shard splitting capability in the Collections API, but that only 
> works on a single shard, not the whole collection. If you wanted to adjust 
> from say 6 to 8 shards and still have the shards be approximately equal in 
> size, you would need to build a new collection and completely reindex.
> 
> There have been a number of issues filed for a rebalance feature, but it has 
> not been implemented because implementing it would involve a very large 
> amount of work, and making it stable would take even more work.  And I am not 
> even sure the Lucene API has the capability to do it currently.
> 
> https://issues.apache.org/jira/browse/SOLR-9241
> 
>> What is your recommendation? Should we start with 8.11? Will it be a
>> substitute soon?
> 
> I have no idea whether a substitute will be available.  You can manually do 
> everything that the autoscaler would do with the Collections API.
> 
> Thanks,
> Shawn
> 

Reply via email to