[ https://issues.apache.org/jira/browse/FLINK-24633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17438711#comment-17438711 ]
Aitozi commented on FLINK-24633: -------------------------------- Pod may stuck in some case: 1. allocate ip timeout 2. mount volume failed 3. sandbox start failed ... I think it can be solved by an external operator to monitor the lifecycle of all the jobMaster on the cluster. > JobManager pod may stuck in containerCreating status during failover > -------------------------------------------------------------------- > > Key: FLINK-24633 > URL: https://issues.apache.org/jira/browse/FLINK-24633 > Project: Flink > Issue Type: Bug > Components: Deployment / Kubernetes > Affects Versions: 1.14.0 > Reporter: Aitozi > Priority: Major > -- This message was sent by Atlassian Jira (v8.3.4#803005)