Thomas Weise created FLINK-29109:
------------------------------------

             Summary: Checkpoint path conflict with stateless upgrade mode
                 Key: FLINK-29109
                 URL: https://issues.apache.org/jira/browse/FLINK-29109
             Project: Flink
          Issue Type: Bug
          Components: Kubernetes Operator
    Affects Versions: kubernetes-operator-1.1.0
            Reporter: Thomas Weise
            Assignee: Thomas Weise


A stateful job with stateless upgrade mode (yes, there are such use cases) 
fails with checkpoint path conflict due to constant jobId and FLINK-19358 
(applies to Flink < 1.16x). Since with stateless upgrade mode the checkpoint id 
resets on restart the job is going to write to previously used locations and 
fail. The workaround is to rotate the jobId on every redeploy when the upgrade 
mode is stateless. While this can be worked around externally it is best done 
in the operator itself because reconciliation resolves when a restart is 
actually required while rotating jobId externally may trigger unnecessary 
restarts.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to