Yang Wang created FLINK-25865: --------------------------------- Summary: Support to set restart policy of TaskManager pod for native K8s integration Key: FLINK-25865 URL: https://issues.apache.org/jira/browse/FLINK-25865 Project: Flink Issue Type: Improvement Components: Deployment / Kubernetes Reporter: Yang Wang
After FLIP-201, Flink's TaskManagers will be able to be restarted without losing its local state. So it is reasonable to make the restart policy[1] of TaskManager pod could be configured. The current restart policy is {{{}Never{}}}. Flink will always delete the failed TaskManager pod directly and create a new one instead. This ticket could help to decrease the recovery time of TaskManager failure. Please note that the working directory needs to be located in the emptyDir[1], which is retained in different restarts. [1]. https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#restart-policy [2]. https://kubernetes.io/docs/concepts/storage/volumes/#emptydir -- This message was sent by Atlassian Jira (v8.20.1#820001)