Verifying correctness of StreamingFileSink (Kafka -> S3)

amran dean Tue, 15 Oct 2019 19:01:13 -0700

I am evaluating StreamingFileSink (Kafka 0.10.11) as a production-ready
alternative to a current Kafka -> S3 solution.


Is there any way to verify the integrity of data written in S3? I'm
confused how the file names (e.g part-1-17) map to Kafka partitions, and
further unsure how to ensure that no Kafka records are lost (I know Flink
guarantees exactly-once, but this is more of a sanity check).

Verifying correctness of StreamingFileSink (Kafka -> S3)

Reply via email to