zhijiang created FLINK-6120: ------------------------------- Summary: Implement heartbeat logic between JobMaster and ResourceManager Key: FLINK-6120 URL: https://issues.apache.org/jira/browse/FLINK-6120 Project: Flink Issue Type: Improvement Reporter: zhijiang Assignee: zhijiang
It is part of work for Flip-6. The HeartbeatManager is mainly used for monitoring heartbeat target and reporting payloads. For {{ResourceManager}} side, it would trigger monitoring the {{HeartbeatTarget}} when receive registration from {{JobMaster}}, and schedule a task to {{requestHeartbeat}} at interval time. If not receive heartbeat response within duration time, the {{HeartbeatListener}} will notify heartbeat timeout, then the {{ResourceManager}} should remove the internal registered {{JobMaster}}. For {{JobMaster}} side, it would trigger monitoring the {{HeartbeatTarget}} when receive registration acknowledgement from {{ResourceManager}}. An it will also be notified heartbeat timeout if not receive heartbeat request from {{ResourceManager}} within duration time. The current implementation will not interact payloads via heartbeat, and it can be added if needed future. -- This message was sent by Atlassian JIRA (v6.3.15#6346)