Hi Everyone,

since kafka 0.11.x supports exactly-once semantics, I want to be sure,
that it is possible to achieve it across kafka clusters using MirrorMaker.

We have got two locations with "primary" cluster in each location and
for each location we have got one "aggregation" cluster which mirrors
data from all primary clusters.

Currently we deduplicate messages when we copying data from aggregation
kafka to HDFS by separete YARN application. But in aggregation kafka
duplicates remains. So I want to ensure that there are no duplicates and
loss data in kafka and not to use our deduplication yarn application
anymore.

If it is possible, how to configure MirrorMaker to achieve exactly-once
delivery across primary and aggregation clusters?


Thanks and have a nice day, Jiri Humpolicek



Je dobré vědět, že tento e-mail a přílohy jsou důvěrné. Pokud spolu jednáme o 
uzavření obchodu, vyhrazujeme si právo naše jednání kdykoli ukončit. Pro fanoušky 
právní mluvy - vylučujeme tím ustanovení občanského zákoníku o předsmluvní 
odpovědnosti. Pravidla o tom, kdo u nás a jak vystupuje za společnost a kdo může co a 
jak podepsat naleznete zde<https://onas.seznam.cz/cz/podpisovy-rad-cz.html>

You should know that this e-mail and its attachments are confidential. If we are 
negotiating on the conclusion of a transaction, we reserve the right to terminate the 
negotiations at any time. For fans of legalese—we hereby exclude the provisions of 
the Civil Code on pre-contractual liability. The rules about who and how may act for 
the company and what are the signing procedures can be found 
here<https://onas.seznam.cz/cz/signature-rules.html>.

Reply via email to