Any help here please.

Regards,
Kartik


On Fri, Feb 9, 2024, 8:33 AM Kartik Kushwaha <kushwaha.karti...@gmail.com>
wrote:

> I am using flink checkpointing to restore states of my job. I am using
> unaligned checkpointing with 100 ms as the checkpointing interval. I see
> few events getting dropped that were sucessfully processed by the operators
> or were in-flight that were yet to be captured by checkpoint. That is these
> were new events which came into the pipeline between the previously
> captured checkpoint state and the failure.
> <https://stackoverflow.com/posts/77955164/timeline>
>
> My project acknowledges(commits) back to the topic after the event read
> and mongo ingestion. But the pipeline has transformation, enrichment and
> sink operators after that. These missing events were read, ack'd back to
> the topic and transformed successfully before failure and were not yet
> checkpointed (withing the 100 ms interval between checkpoints) were dropped.
>
> Pipeline: Source (solace topic, queue reader) --> [MongoWrite +
> sourcecommit] --> transform --> enrich --> sink (solace topic)
>
> Checkpoint: Unaligned, Exactly-once, 100ms interval, 10ms Min pause
> between checkpoint
>
>    1.
>
>    I see that whenever runtime exception is thrown it triggers the close
>    method in each of the functions one by one. Do we have to store the states
>    which were not yet captured by the checkpoint before failure? What happens
>    Network failures or task manager crash or any other abrupt failure?
>    2.
>
>    Or do we have to shift the source topic acknowledgment to the last (
>    but we will have to chain all this operators to run in a single thread and
>    carry the bytearray message object from solace queue to do ack at the end).
>
> Is there anything else Iam missing here?
>
>
> Note: Sources and Sinks are fully Solace based in all the Flink pipelines
> ( queues and topics)
>
>
>

Reply via email to