Stephan Ewen created FLINK-5230: ----------------------------------- Summary: Safety nets against leaving dysfunctional JobManagers Key: FLINK-5230 URL: https://issues.apache.org/jira/browse/FLINK-5230 Project: Flink Issue Type: Improvement Components: Distributed Coordination Reporter: Stephan Ewen
There are certain ways that a {{JobManager}} can become dysfunctional. If the JobManager process continues to exist (not restarted by YARN / Mesos) etc, but is not doing its work properly and more, it makes the Streaming Job unavailable. There some safety nets to bring into place for that, see sub issues. -- This message was sent by Atlassian JIRA (v6.3.4#6332)