Chesnay Schepler created FLINK-24887: ----------------------------------------
Summary: Retrying savepoints may cause early cluster shutdown Key: FLINK-24887 URL: https://issues.apache.org/jira/browse/FLINK-24887 Project: Flink Issue Type: Bug Components: Runtime / REST Affects Versions: 1.15.0 Reporter: Chesnay Schepler Assignee: Chesnay Schepler Fix For: 1.15.0 If an operation is retried we potentially access the result of a previous attempt to see if it has already failed and eagerly fail the trigger request. If that attempt is already complete then this may lead to an unexpected shutdown of the cluster. Beyond this issue, the eager checking of previous attempts makes error handling more complicated, because you have to cover all cases for both the trigger and status-retrieval operations. -- This message was sent by Atlassian Jira (v8.20.1#820001)