Aitozi created FLINK-28531: ------------------------------ Summary: Shutdown cluster after history server archive finished Key: FLINK-28531 URL: https://issues.apache.org/jira/browse/FLINK-28531 Project: Flink Issue Type: Bug Components: Runtime / Coordination Reporter: Aitozi
I met a problem that the job cluster may be shutdown with history server archive file upload not finished. After some research, It's may be caused by two reason. First, the {{HistoryServerArchivist#archiveExecutionGraph}} is not wait to complete Second, the deregisterApp in the {{KubernetesResourceManagerDriver#deregisterApplication}} will directly remove the deployment. So in the shutdown flow in ClusterEntrypoint, it will first trigger the delete deployment, it will cause the master pod deleted with some operation/future can not finished -- This message was sent by Atlassian Jira (v8.20.10#820010)