Harsha, yes I can do that. I'll update the KIP accordingly, thanks. Ryanne
On Wed, Oct 17, 2018 at 12:18 PM Harsha <ka...@harsha.io> wrote: > Hi Ryanne, > Thanks for the KIP. I am also curious about why not use the > uReplicator design as the foundation given it alreadys resolves some of the > fundamental issues in current MIrrorMaker, updating the confifgs on the fly > and running the mirror maker agents in a worker model which can deployed in > mesos or container orchestrations. If possible can you document in the > rejected alternatives what are missing parts that made you to consider a > new design from ground up. > > Thanks, > Harsha > > On Wed, Oct 17, 2018, at 8:34 AM, Ryanne Dolan wrote: > > Jan, these are two separate issues. > > > > 1) consumer coordination should not, ideally, involve unreliable or slow > > connections. Naively, a KafkaSourceConnector would coordinate via the > > source cluster. We can do better than this, but I'm deferring this > > optimization for now. > > > > 2) exactly-once between two clusters is mind-bending. But keep in mind > that > > transactions are managed by the producer, not the consumer. In fact, it's > > the producer that requests that offsets be committed for the current > > transaction. Obviously, these offsets are committed in whatever cluster > the > > producer is sending to. > > > > These two issues are closely related. They are both resolved by not > > coordinating or committing via the source cluster. And in fact, this is > the > > general model of SourceConnectors anyway, since most SourceConnectors > > _only_ have a destination cluster. > > > > If there is a lot of interest here, I can expound further on this aspect > of > > MM2, but again I think this is premature until this first KIP is > approved. > > I intend to address each of these in separate KIPs following this one. > > > > Ryanne > > > > On Wed, Oct 17, 2018 at 7:09 AM Jan Filipiak <jan.filip...@trivago.com> > > wrote: > > > > > This is not a performance optimisation. Its a fundamental design > choice. > > > > > > > > > I never really took a look how streams does exactly once. (its a trap > > > anyways and you usually can deal with at least once donwstream pretty > > > easy). But I am very certain its not gonna get somewhere if offset > > > commit and record produce cluster are not the same. > > > > > > Pretty sure without this _design choice_ you can skip on that exactly > > > once already > > > > > > Best Jan > > > > > > On 16.10.2018 18:16, Ryanne Dolan wrote: > > > > > But one big obstacle in this was > > > > always that group coordination happened on the source cluster. > > > > > > > > Jan, thank you for bringing up this issue with legacy MirrorMaker. I > > > > totally agree with you. This is one of several problems with > MirrorMaker > > > > I intend to solve in MM2, and I already have a design and prototype > that > > > > solves this and related issues. But as you pointed out, this KIP is > > > > already rather complex, and I want to focus on the core feature set > > > > rather than performance optimizations for now. If we can agree on > what > > > > MM2 looks like, it will be very easy to agree to improve its > performance > > > > and reliability. > > > > > > > > That said, I look forward to your support on a subsequent KIP that > > > > addresses consumer coordination and rebalance issues. Stay tuned! > > > > > > > > Ryanne > > > > > > > > On Tue, Oct 16, 2018 at 6:58 AM Jan Filipiak < > jan.filip...@trivago.com > > > > <mailto:jan.filip...@trivago.com>> wrote: > > > > > > > > Hi, > > > > > > > > Currently MirrorMaker is usually run collocated with the target > > > > cluster. > > > > This is all nice and good. But one big obstacle in this was > > > > always that group coordination happened on the source cluster. So > > > when > > > > then network was congested, you sometimes loose group membership > and > > > > have to rebalance and all this. > > > > > > > > So one big request from we would be the support of having > > > coordination > > > > cluster != source cluster. > > > > > > > > I would generally say a LAN is better than a WAN for doing group > > > > coordinaton and there is no reason we couldn't have a group > consuming > > > > topics from a different cluster and committing offsets to another > > > > one right? > > > > > > > > Other than that. It feels like the KIP has too much features > where > > > many > > > > of them are not really wanted and counter productive but I will > just > > > > wait and see how the discussion goes. > > > > > > > > Best Jan > > > > > > > > > > > > On 15.10.2018 18:16, Ryanne Dolan wrote: > > > > > Hey y'all! > > > > > > > > > > Please take a look at KIP-382: > > > > > > > > > > > > > > > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-382%3A+MirrorMaker+2.0 > > > > > > > > > > Thanks for your feedback and support. > > > > > > > > > > Ryanne > > > > > > > > > > > > >