Re: Kafka connect replication using MirrorMaker 2.0

Ning Zhang Mon, 22 Mar 2021 14:42:12 -0700

Hi Daniel, it is probably hard to figure out how to sync topics across two 
kafka connect cluster.


In general, if implementing a solution requires strong technical pre-requisite 
(e.g. kafka connect will be aware of each other offset), It may be better to go 
with simpler solution first, for example, "active-active" MM2 setup on one 
kafka connect cluster.

On 2021/03/19 14:23:30, dandanie...@gmail.com wrote: 
> Hello and thank you for the reply! 
> 
> My problem is not with consumption of messages, because as you said, 
> MirrorMaker2 knows how to deal with the consumer offsets. Rather my problem 
> is with source connectors and the topic connect-offsets. 
> 
> Because Kafka connect manages where it stopped reading from a source using to 
> topic, the second connect cluster is not aware of the offsets and I can get 
> duplicates. I know MirrorMaker2 does not guarantee exactly-once delivery but 
> is there a way to maybe sync to topics so the kafka connect will be aware of 
> each other offset. 
> 
> Daniel 
> 
> > On 19 Mar 2021, at 9:09, Ning Zhang <ning2008w...@gmail.com> wrote:
> > Hi Daniel, MirrorMaker2 creates its own "offsets" topic to track the 
> > process of consumption.
> > 
> > just my 2 cents - If you already have two Kafka connect clusters in two 
> > different sites, it sounds practical to:
> > (1) use "cluster" mode, instead of "dedicated" mode of MirrorMaker2
> > (2) add one "MirrorMaker" connector on each Kafka connect cluster and do 
> > "one-way" replication. As you have two "MirrorMaker" connectors, it should 
> > behave like "active-active" deployment.
> > 
> > Just in case you only have one  Kafka connect cluster, do "active-active" 
> > replication, rather than "one-way"
> > 
> > On 2021/03/18 08:15:40, Daniel Beilin <dandanie...@gmail.com> wrote: 
> >> Hi everyone,
> >> I'm trying to create an active-active deployment of a kafka cluster between
> >> two data centers using MirrorMaker2, but I'm facing a problem.
> >> In my deployment I have Kafka Connect in both sites which each of them
> >> connect to different database using sink and source connectors (MongoDB
> >> source connector , JDBC sink/source connector)
> >> I’d like to know what’s the best practice for active-active is using Kafka
> >> connect , since I noticed the “connect-offsets” topic is not replicated in
> >> mm2.
> >> 
> >> Best regards,
> >> Daniel
>

Re: Kafka connect replication using MirrorMaker 2.0

Reply via email to