[ https://issues.apache.org/jira/browse/FLINK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16217699#comment-16217699 ]
Jing Fan commented on FLINK-4808: --------------------------------- [~ram_krish][~StephanEwen][~till.rohrmann] Do we have any follow up on this problem possible eta? Unable to skip failed checkpoint is blocking migrating critical jobs on to flink platform. We can also contribute to solve this problem if needed. > Allow skipping failed checkpoints > --------------------------------- > > Key: FLINK-4808 > URL: https://issues.apache.org/jira/browse/FLINK-4808 > Project: Flink > Issue Type: New Feature > Components: State Backends, Checkpointing > Affects Versions: 1.1.2, 1.1.3 > Reporter: Stephan Ewen > Fix For: 1.4.0 > > > Currently, if Flink cannot complete a checkpoint, it results in a failure and > recovery. > To make the impact of less stable storage infrastructure on the performance > of Flink less severe, Flink should be able to tolerate a certain number of > failed checkpoints and simply keep executing. > This should be controllable via a parameter, for example: > {code} > env.getCheckpointConfig().setAllowedFailedCheckpoints(3); > {code} > A value of {{-1}} could indicate an infinite number of checkpoint failures > tolerated by Flink. > The default value should still be {{0}}, to keep compatibility with the > existing behavior. -- This message was sent by Atlassian JIRA (v6.4.14#64029)