Thanks Theo !

From: "theo.diefent...@scoop-software.de" <theo.diefent...@scoop-software.de>
Date: Thursday, 13 February 2020 at 12:13 AM
To: "Hegde, Mahendra" <mahendra.he...@arity.com>, "user@flink.apache.org" 
<user@flink.apache.org>
Subject: [External] AW: How Flink Kafka Consumer works when it restarts

Hi Mahendra,

Flink will regularly create checkpoints or manually triggered savepoints. This 
is data managed and stored by Flink and that data also contains the kafka 
offsets.

When restarting, you can configure to restart from the last checkpoint and or 
savepoint.

You can additionally configure Flink to commit the offsets to kafka, again, on 
checkpoint only. You can then configure Flink to restart from the committed 
offset, if you don't let Flink restart from an existing checkpoint or 
savepoint, where it would first search in to retore the offsets.

Having the offsets loaded either from checkpoint, savepoint or kafka, it will 
directly communicate with Kafka and ask kafka to poll messages starting from 
those offsets.

Best regards
Theo


Von meinem Huawei-Telefon gesendet


-------- Ursprüngliche Nachricht --------
Von: "Hegde, Mahendra" <mahendra.he...@arity.com>
Datum: Mi., 12. Feb. 2020, 17:50
An: user@flink.apache.org
Betreff: How Flink Kafka Consumer works when it restarts
Hi All,

I am bit confused on Flink kafka consumer working.
I read that Flink stores the kafka message offset in checkpoint and uses it in 
case if it restarts.

Question is when exactly Flink is committing about successful consumption 
confirmation to kafka broker?
And when Flink job restarts will it send last offset which is available in 
checkpoint to kafka broker to start consuming from that point ?
Or Kafka broker will resume based on last committed offset information 
available?
(I mean who manages the actual offset here, Kafka broker or the Flink client)

Thanks
Mahendra

Reply via email to