[ 
https://issues.apache.org/jira/browse/FLINK-36455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17887987#comment-17887987
 ] 

Arvid Heise commented on FLINK-36455:
-------------------------------------

I'm not an expert of the SFS but from cross checking the code, it looks safe 
because it has no retries whatsoever. So it would fail on transient error and 
trigger a new final checkpoint which is in accordance with the contract of 
notifyCheckpointCompleted.

> Sink should commit everything on notifyCheckpointCompleted
> ----------------------------------------------------------
>
>                 Key: FLINK-36455
>                 URL: https://issues.apache.org/jira/browse/FLINK-36455
>             Project: Flink
>          Issue Type: Bug
>          Components: API / Core
>            Reporter: Arvid Heise
>            Assignee: Arvid Heise
>            Priority: Major
>             Fix For: 2.0-preview
>
>
> Currently, we retry committables at some time later until they eventually 
> succeed.
> However, that violates the contract of notifyCheckpointCompleted which states 
> that all side effect must be committed before returning the method. In 
> particular, notifyCheckpointCompleted must fail if we cannot guarantee that 
> all side effects are committed for final checkpoints. As soon as 
> notifyCheckpointCompleted returns, the final checkpoint is deemed completed, 
> which currently may mean that some transactions are still open.
> The solution is that all retries must happen in a close loop in 
> notifyCheckpointCompleted.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to