Thanks Anshum, Dima!  Yes, I figure this approach will be quite challenging
to implement, and may not be worth the cost.

Anshum,

I had not thought of versioning (
https://solr.apache.org/guide/8_2/updating-parts-of-documents.html#document-centric-versioning-constraints),
but will consider it.  Yes, some of our updates are Atomic updates.

Yes, initial thinking is using a single "queue" of updates where multiple
instances (associated to a particular SorlCloud instance) of the same
indexing service will consume from the queue and index to their associated
SolrCloud instance.

I will take a look at your proposal!

Thanks again,

Matt

On Tue, Mar 1, 2022 at 4:12 PM Anshum Gupta <ans...@anshumgupta.net> wrote:

> Hi Matt,
>
> I'll start by saying that this has been long due at my end.
>
> There are a multitude of challenges with a hot-hot architecture involving
> multiple SolrCloud clusters. An important question here is if you are going
> to manage the versioning yourself. Also, if your updates would ever
> overwrite data. Here's an initial proposal for something along those lines
> (but doesn't support an unversioned hot-hot setup w/ document edits) -
>
> https://cwiki.apache.org/confluence/display/SOLR/SIP-13%3A+Cross+Data+Center+Replication
>
> Hot-Hot setups are really complex and there are a few ways I've handled (or
> seen them being handled.
> 1. The best way here is to either have externally versioned documents sent
> to Solr clusters or
> 2. rely on a single point of entry i.e. updates always go to a queuing
> service for instance and then have an application that's responsible for
> consuming from this (queue?).
>
> -Anshum
>
> On Tue, Mar 1, 2022 at 3:02 PM mtn search <search...@gmail.com> wrote:
>
> > Hello,
> >
> > My team is looking to deploy Solr 8 SolrCloud on two on-prem datacenters
> > via EKS.  We are considering a HOT | HOT HA architecture between the data
> > centers where data would be indexed (duplicated) to SolrCloud instances
> in
> > both datacenters. Then via service (to be worked out) queries could go to
> > either datacenter.
> >
> > I believe one of the challenges will be keeping the SolrCloud instances
> > (holding the same data) in sync.
> >
> > I am curious if others have tried this and are willing to share any tips,
> > lessons learned, or things we should consider.
> >
> > Thanks,
> > Matt
> >
>
>
> --
> Anshum Gupta
>

Reply via email to