Hi Everyone,

since kafka 0.11.x supports exactly-once semantics, I want to be sure,
that it is possible to achieve it across kafka clusters using MirrorMaker.

We have got two locations with "primary" cluster in each location and
for each location we have got one "aggregation" cluster which mirrors
data from all primary clusters.

Currently we deduplicate messages when we copying data from aggregation
kafka to HDFS by separete YARN application. But in aggregation kafka
duplicates remains. So I want to ensure that there are no duplicates and
loss data in kafka and not to use our deduplication yarn application

If it is possible, how to configure MirrorMaker to achieve exactly-once
delivery across primary and aggregation clusters?

Thanks and have a nice day, Jiri Humpolicek

