Weihua Hu created FLINK-31498: --------------------------------- Summary: DeclartiveSlotManager always request redundant task manager when resource is not enough Key: FLINK-31498 URL: https://issues.apache.org/jira/browse/FLINK-31498 Project: Flink Issue Type: Bug Components: Runtime / Coordination Reporter: Weihua Hu Attachments: image-2023-03-17-18-05-43-088.png
Currently redundant task manager check in DeclarativeSlotManager only compare free slots with required redundant slots. when there are no enough resources in YARN/Kubernetes, this mechanism will always try to request new task manager. there are two way to address this. 1. maintain the state of redundant workers to avoid request twice 2. only try to request redundant workers when there is no pending worker The first way will make the logic of redundant worker too complicated, I would like to choose the second way Looking forward to any suggestion. !image-2023-03-17-18-05-43-088.png! -- This message was sent by Atlassian Jira (v8.20.10#820010)