Ryanne, Makes sense. Can you please add this under rejected alternatives so that everyone has context on why it wasn’t picked.
Thanks, Harsha On Oct 18, 2018, 8:02 AM -0700, Ryanne Dolan <ryannedo...@gmail.com>, wrote: > Harsha, concerning uReplicator specifically, the project is a major > inspiration for MM2, but I don't think it is a good foundation for anything > included in Apache Kafka. uReplicator uses Helix to solve problems that > Connect also solves, e.g. REST API, live configuration changes, cluster > management, coordination etc. This also means that existing tooling, > dashboards etc that work with Connectors do not work with uReplicator, and > any future tooling would need to treat uReplicator as a special case. > > Ryanne > > > On Wed, Oct 17, 2018 at 12:30 PM Ryanne Dolan <ryannedo...@gmail.com> wrote: > > > Harsha, yes I can do that. I'll update the KIP accordingly, thanks. > > > > > > Ryanne > > > > > > > On Wed, Oct 17, 2018 at 12:18 PM Harsha <ka...@harsha.io> wrote: > > > > > Hi Ryanne, > > > > > Thanks for the KIP. I am also curious about why not > > > > > use the uReplicator design as the foundation given it alreadys > > > > > resolves some of the fundamental issues in current MIrrorMaker, > > > > > updating the confifgs on the fly and running the mirror maker agents > > > > > in a worker model which can deployed in mesos or container > > > > > orchestrations. If possible can you document in the rejected > > > > > alternatives what are missing parts that made you to consider a new > > > > > design from ground up. > > > > > > > > > > Thanks, > > > > > Harsha > > > > > > > > > > On Wed, Oct 17, 2018, at 8:34 AM, Ryanne Dolan wrote: > > > > > > Jan, these are two separate issues. > > > > > > > > > > > > 1) consumer coordination should not, ideally, involve unreliable or > > > > > > slow > > > > > > connections. Naively, a KafkaSourceConnector would coordinate via > > > > > > the > > > > > > source cluster. We can do better than this, but I'm deferring this > > > > > > optimization for now. > > > > > > > > > > > > 2) exactly-once between two clusters is mind-bending. But keep in > > > > > > mind that > > > > > > transactions are managed by the producer, not the consumer. In > > > > > > fact, it's > > > > > > the producer that requests that offsets be committed for the current > > > > > > transaction. Obviously, these offsets are committed in whatever > > > > > > cluster the > > > > > > producer is sending to. > > > > > > > > > > > > These two issues are closely related. They are both resolved by not > > > > > > coordinating or committing via the source cluster. And in fact, > > > > > > this is the > > > > > > general model of SourceConnectors anyway, since most > > > > > > SourceConnectors > > > > > > _only_ have a destination cluster. > > > > > > > > > > > > If there is a lot of interest here, I can expound further on this > > > > > > aspect of > > > > > > MM2, but again I think this is premature until this first KIP is > > > > > > approved. > > > > > > I intend to address each of these in separate KIPs following this > > > > > > one. > > > > > > > > > > > > Ryanne > > > > > > > > > > > > On Wed, Oct 17, 2018 at 7:09 AM Jan Filipiak > > > > > > <jan.filip...@trivago.com> > > > > > > wrote: > > > > > > > > > > > > > This is not a performance optimisation. Its a fundamental design > > > > > > > choice. > > > > > > > > > > > > > > > > > > > > > I never really took a look how streams does exactly once. (its a > > > > > > > trap > > > > > > > anyways and you usually can deal with at least once donwstream > > > > > > > pretty > > > > > > > easy). But I am very certain its not gonna get somewhere if offset > > > > > > > commit and record produce cluster are not the same. > > > > > > > > > > > > > > Pretty sure without this _design choice_ you can skip on that > > > > > > > exactly > > > > > > > once already > > > > > > > > > > > > > > Best Jan > > > > > > > > > > > > > > On 16.10.2018 18:16, Ryanne Dolan wrote: > > > > > > > > > But one big obstacle in this was > > > > > > > > always that group coordination happened on the source cluster. > > > > > > > > > > > > > > > > Jan, thank you for bringing up this issue with legacy > > > > > > > > MirrorMaker. I > > > > > > > > totally agree with you. This is one of several problems with > > > > > > > > MirrorMaker > > > > > > > > I intend to solve in MM2, and I already have a design and > > > > > > > > prototype that > > > > > > > > solves this and related issues. But as you pointed out, this > > > > > > > > KIP is > > > > > > > > already rather complex, and I want to focus on the core feature > > > > > > > > set > > > > > > > > rather than performance optimizations for now. If we can agree > > > > > > > > on what > > > > > > > > MM2 looks like, it will be very easy to agree to improve its > > > > > > > > performance > > > > > > > > and reliability. > > > > > > > > > > > > > > > > That said, I look forward to your support on a subsequent KIP > > > > > > > > that > > > > > > > > addresses consumer coordination and rebalance issues. Stay > > > > > > > > tuned! > > > > > > > > > > > > > > > > Ryanne > > > > > > > > > > > > > > > > On Tue, Oct 16, 2018 at 6:58 AM Jan Filipiak > > > > > > > > <jan.filip...@trivago.com > > > > > > > > <mailto:jan.filip...@trivago.com>> wrote: > > > > > > > > > > > > > > > > Hi, > > > > > > > > > > > > > > > > Currently MirrorMaker is usually run collocated with the > > > > > > > >target > > > > > > > > cluster. > > > > > > > > This is all nice and good. But one big obstacle in this was > > > > > > > > always that group coordination happened on the source > > > > > > > >cluster. So > > > > > > > when > > > > > > > > then network was congested, you sometimes loose group > > > > > > > >membership and > > > > > > > > have to rebalance and all this. > > > > > > > > > > > > > > > > So one big request from we would be the support of having > > > > > > > coordination > > > > > > > > cluster != source cluster. > > > > > > > > > > > > > > > > I would generally say a LAN is better than a WAN for doing > > > > > > > >group > > > > > > > > coordinaton and there is no reason we couldn't have a group > > > > > > > >consuming > > > > > > > > topics from a different cluster and committing offsets to > > > > > > > >another > > > > > > > > one right? > > > > > > > > > > > > > > > > Other than that. It feels like the KIP has too much > > > > > > > >features where > > > > > > > many > > > > > > > > of them are not really wanted and counter productive but I > > > > > > > >will just > > > > > > > > wait and see how the discussion goes. > > > > > > > > > > > > > > > > Best Jan > > > > > > > > > > > > > > > > > > > > > > > > On 15.10.2018 18:16, Ryanne Dolan wrote: > > > > > > > > > Hey y'all! > > > > > > > > > > > > > > > > > > Please take a look at KIP-382: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-382%3A+MirrorMaker+2.0 > > > > > > > > > > > > > > > > > > Thanks for your feedback and support. > > > > > > > > > > > > > > > > > > Ryanne > > > > > > > > > > > > > > > > > > > > > > > >