[ https://issues.apache.org/jira/browse/FLINK-10462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
zhijiang updated FLINK-10462: ----------------------------- Description: Every {{IntermediateResult}} generates a random {{ConnectionIndex}} which will be included in {{ConnectionID}}. The {{RemoteInputChannel}} requests to establish tcp connection via {{ConnectionID}}. That means one tcp connection may be shared by multiple {{RemoteInputChannel}} {{s which have the same ConnectionID}}. To do so, we can reduce the physical connections between two {{TaskManager}}s, and it brings benefits for large scale jobs. But this sharing is limited only for the same {{IntermediateResult}}, and I think it is mainly because we may temporarily switch off {{autoread}} for the channel during back pressure in previous network flow control. For credit-based mode, the channel is always open for transporting different intermediate data, so we can further share the tcp connection for different {{IntermediateResults}} to remove the limit. was: Every {{IntermediateResult}} generates a random {{ConnectionIndex}} which will be included in {{ConnectionID}}. The {{RemoteInputChannel}} requests to establish tcp connection via {{ConnectionID}}. That means one tcp connection may be shared by multiple {{RemoteInputChannel}} {{s which have the same ConnectionID}}. To do so, we can reduce the physical connections between two {{TaskManager}} s, and it brings benefits for large scale jobs. But this sharing is limited only for the same {{IntermediateResult}}, and I think it is mainly because we may temporarily switch off {{autoread}} for the channel during back pressure in previous network flow control. For credit-based mode, the channel is always open for transporting different intermediate data, so we can further share the tcp connection for different {{IntermediateResults}} to remove the limit. > Remove ConnectionIndex for further sharing tcp connection in credit-based > mode > ------------------------------------------------------------------------------- > > Key: FLINK-10462 > URL: https://issues.apache.org/jira/browse/FLINK-10462 > Project: Flink > Issue Type: Improvement > Components: Network > Affects Versions: 1.5.0, 1.5.1, 1.5.2, 1.5.3, 1.6.0, 1.6.1, 1.5.4 > Reporter: zhijiang > Assignee: zhijiang > Priority: Minor > > Every {{IntermediateResult}} generates a random {{ConnectionIndex}} which > will be included in {{ConnectionID}}. > The {{RemoteInputChannel}} requests to establish tcp connection via > {{ConnectionID}}. That means one tcp connection may be shared by multiple > {{RemoteInputChannel}} {{s which have the same ConnectionID}}. To do so, we > can reduce the physical connections between two {{TaskManager}}s, and it > brings benefits for large scale jobs. > But this sharing is limited only for the same {{IntermediateResult}}, and I > think it is mainly because we may temporarily switch off {{autoread}} for the > channel during back pressure in previous network flow control. For > credit-based mode, the channel is always open for transporting different > intermediate data, so we can further share the tcp connection for different > {{IntermediateResults}} to remove the limit. -- This message was sent by Atlassian JIRA (v7.6.3#76005)