[jira] [Created] (FLINK-24887) Retrying savepoints may cause early cluster shutdown

Chesnay Schepler (Jira) Fri, 12 Nov 2021 03:12:04 -0800

Chesnay Schepler created FLINK-24887:
----------------------------------------


             Summary: Retrying savepoints may cause early cluster shutdown
                 Key: FLINK-24887
                 URL: https://issues.apache.org/jira/browse/FLINK-24887
             Project: Flink
          Issue Type: Bug
          Components: Runtime / REST
    Affects Versions: 1.15.0
            Reporter: Chesnay Schepler
            Assignee: Chesnay Schepler
             Fix For: 1.15.0


If an operation is retried we potentially access the result of a previous 
attempt to see if it has already failed and eagerly fail the trigger request. 
If that attempt is already complete then this may lead to an unexpected 
shutdown of the cluster.

Beyond this issue, the eager checking of previous attempts makes error handling 
more complicated, because you have to cover all cases for both the trigger and 
status-retrieval operations.





--
This message was sent by Atlassian Jira
(v8.20.1#820001)

[jira] [Created] (FLINK-24887) Retrying savepoints may cause early cluster shutdown

Reply via email to