On 1/17/23 10:05, Matthew Castrigno wrote:
What is the best approach for high availability for my deployed SOLR
instance? Solr clusters seems to be for scaling, which is not my concern
as my use case is a simple site search.
This SOLR instance is expected to serve about 150K requests per month.
The documents are relatively small with about 50 fields indexed each.
If you go with SolrCloud, you need three servers minimum. Zookeeper
requires 3 or more for high availability. Solr itself only requires
two. So the most minimal cloud setup is 3 servers all running ZK, with
two of them also running Solr.
All servers must be on separate physical hardware. If you create three
VMs all on the same host, then if that host dies, your Solr is
completely down.
Nginx was mentioned as a load balancer. I use haproxy, but nginx would
work well too. It is a good idea when setting up load balancing to have
a virtual IP address. Two programs for setting that up are ucarp and
pacemaker. I'm sure there are more options.
Thanks,
Shawn