Till Rohrmann created FLINK-9324: ------------------------------------ Summary: SingleLogicalSlot returns completed release future before slot is properly returned Key: FLINK-9324 URL: https://issues.apache.org/jira/browse/FLINK-9324 Project: Flink Issue Type: Bug Components: Distributed Coordination Affects Versions: 1.5.0, 1.6.0 Reporter: Till Rohrmann Assignee: Till Rohrmann Fix For: 1.5.0
The {{SingleLogicalSlot#releaseSlot}} method returns a future which is completed once the slot has been returned to the {{SlotOwner}}. Unfortunately, we don't wait for the {{SlotOwner's}} response to complete the future but complete it directly after the call has been made. This causes that the {{ExecutionGraph}} can get restarted in case of a recovery before all of its slots have been returned to the {{SlotPool}}. As a consequence, the allocation of the new tasks might require more than the max parallelism because of collisions with old tasks (in case of slot sharing). -- This message was sent by Atlassian JIRA (v7.6.3#76005)