GitHub user tillrohrmann opened a pull request: https://github.com/apache/flink/pull/3183
[backport] [FLINK-5214] [FLINK-5229] Backport StreamTask and StreamOperator checkpoint cleanup Backport of #3179 and #3178 onto the `release-1.2` branch. Adds exception handling to the stream operators for the snapshotState method. A failing snapshot operation will trigger the clean up of all so far generated state resources. This will avoid that in case of a failing snapshot operation resources (e.g. files) are left behind. This PR adds operator state cleanup to the StreamTask class. If a stream task contains multiple stream operators, then every operator is checkpointed. In case that a snapshot operation fails all state handles and OperatorSnapshotResults belonging to previous operators have to be freed. You can merge this pull request into a Git repository by running: $ git pull https://github.com/tillrohrmann/flink backportStateCleanup Alternatively you can review and apply these changes as the patch at: https://github.com/apache/flink/pull/3183.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #3183 ---- commit a0edef186c49520e353fe6cdc3321ef208e1bb3b Author: Till Rohrmann <trohrm...@apache.org> Date: 2016-12-01T12:25:05Z [FLINK-5214] Clean up checkpoint data in case of a failing checkpoint operation Adds exception handling to the stream operators for the snapshotState method. A failing snapshot operation will trigger the clean up of all so far generated state resources. This will avoid that in case of a failing snapshot operation resources (e.g. files) are left behind. Add test case for OperatorSnapshotResult Add StateSnapshotContextSynchronousImplTest Add AbstractStreamOperator failing snapshot tests commit 5eb4c2ff00a3818c53bac6c440d83bff0be8501a Author: Till Rohrmann <trohrm...@apache.org> Date: 2017-01-20T13:28:44Z [FLINK-5229] [state] Cleanup of operator snapshots if subsequent operator snapshots fail This PR adds operator state cleanup to the StreamTask class. If a stream task contains multiple stream operators, then every operator is checkpointed. In case that a snapshot operation fails all state handles and OperatorSnapshotResults belonging to previous operators have to be freed. Add test cases for failing checkpoint operations in StreamTask ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---