Till Rohrmann created FLINK-11400: ------------------------------------- Summary: JobManagerRunner does not wait for suspension of JobMaster Key: FLINK-11400 URL: https://issues.apache.org/jira/browse/FLINK-11400 Project: Flink Issue Type: Bug Components: Distributed Coordination Affects Versions: 1.7.1, 1.6.3, 1.8.0 Reporter: Till Rohrmann Assignee: Till Rohrmann Fix For: 1.8.0
The {{JobManagerRunner}} does not wait for the suspension of the {{JobMaster}} to finish before granting leadership again. This can lead to a state where the {{JobMaster}} tries to start the {{ExecutionGraph}} but the {{SlotPool}} is still stopped. I suggest to linearize the leadership operations (granting and revoking leadership) similarly to the {{Dispatcher}}. -- This message was sent by Atlassian JIRA (v7.6.3#76005)