Jan, thanks for the share. Also similar are Pulsar's concepts of namespaces and global topics. I don't think these need to be supported in Kafka itself, but there are many benefits to adopting naming conventions along these lines, esp for tooling, dashboards etc.
> use it to copy my messages from A to B MM2 is a drop-in replacement for MirrorMaker and allows the new features to be disabled if you just want to blindly copy records. Ryanne On Thu, Oct 18, 2018, 2:28 AM Jan Filipiak <jan.filip...@trivago.com> wrote: > then I just hope that in the midsts of all this new features I can still > at least use it to copy my messages from A to B later. > > Another hint you should be aware of: > > https://cwiki.apache.org/confluence/display/KAFKA/Hierarchical+Topics > > That was always a design I admired, with active / active replications > and stuff, It feels like we are going away from this another step. > > On 17.10.2018 17:34, Ryanne Dolan wrote: > > Jan, these are two separate issues. > > > > 1) consumer coordination should not, ideally, involve unreliable or slow > > connections. Naively, a KafkaSourceConnector would coordinate via the > > source cluster. We can do better than this, but I'm deferring this > > optimization for now. > > > > 2) exactly-once between two clusters is mind-bending. But keep in mind > > that transactions are managed by the producer, not the consumer. In > > fact, it's the producer that requests that offsets be committed for the > > current transaction. Obviously, these offsets are committed in whatever > > cluster the producer is sending to. > > > > These two issues are closely related. They are both resolved by not > > coordinating or committing via the source cluster. And in fact, this is > > the general model of SourceConnectors anyway, since most > > SourceConnectors _only_ have a destination cluster. > > > > If there is a lot of interest here, I can expound further on this aspect > > of MM2, but again I think this is premature until this first KIP is > > approved. I intend to address each of these in separate KIPs following > > this one. > > > > Ryanne > > > > On Wed, Oct 17, 2018 at 7:09 AM Jan Filipiak <jan.filip...@trivago.com > > <mailto:jan.filip...@trivago.com>> wrote: > > > > This is not a performance optimisation. Its a fundamental design > choice. > > > > > > I never really took a look how streams does exactly once. (its a trap > > anyways and you usually can deal with at least once donwstream pretty > > easy). But I am very certain its not gonna get somewhere if offset > > commit and record produce cluster are not the same. > > > > Pretty sure without this _design choice_ you can skip on that exactly > > once already > > > > Best Jan > > > > On 16.10.2018 18:16, Ryanne Dolan wrote: > > > > But one big obstacle in this was > > > always that group coordination happened on the source cluster. > > > > > > Jan, thank you for bringing up this issue with legacy > MirrorMaker. I > > > totally agree with you. This is one of several problems with > > MirrorMaker > > > I intend to solve in MM2, and I already have a design and > > prototype that > > > solves this and related issues. But as you pointed out, this KIP > is > > > already rather complex, and I want to focus on the core feature > set > > > rather than performance optimizations for now. If we can agree on > > what > > > MM2 looks like, it will be very easy to agree to improve its > > performance > > > and reliability. > > > > > > That said, I look forward to your support on a subsequent KIP that > > > addresses consumer coordination and rebalance issues. Stay tuned! > > > > > > Ryanne > > > > > > On Tue, Oct 16, 2018 at 6:58 AM Jan Filipiak > > <jan.filip...@trivago.com <mailto:jan.filip...@trivago.com> > > > <mailto:jan.filip...@trivago.com > > <mailto:jan.filip...@trivago.com>>> wrote: > > > > > > Hi, > > > > > > Currently MirrorMaker is usually run collocated with the > target > > > cluster. > > > This is all nice and good. But one big obstacle in this was > > > always that group coordination happened on the source > > cluster. So when > > > then network was congested, you sometimes loose group > > membership and > > > have to rebalance and all this. > > > > > > So one big request from we would be the support of having > > coordination > > > cluster != source cluster. > > > > > > I would generally say a LAN is better than a WAN for doing > group > > > coordinaton and there is no reason we couldn't have a group > > consuming > > > topics from a different cluster and committing offsets to > another > > > one right? > > > > > > Other than that. It feels like the KIP has too much features > > where many > > > of them are not really wanted and counter productive but I > > will just > > > wait and see how the discussion goes. > > > > > > Best Jan > > > > > > > > > On 15.10.2018 18:16, Ryanne Dolan wrote: > > > > Hey y'all! > > > > > > > > Please take a look at KIP-382: > > > > > > > > > > > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-382%3A+MirrorMaker+2.0 > > > > > > > > Thanks for your feedback and support. > > > > > > > > Ryanne > > > > > > > > > >