Yingjie Cao created FLINK-28380: ----------------------------------- Summary: Produce one intermediate dataset for multiple consumers consuming the same data Key: FLINK-28380 URL: https://issues.apache.org/jira/browse/FLINK-28380 Project: Flink Issue Type: Sub-task Components: Client / Job Submission, Runtime / Coordination, Runtime / Network Reporter: Yingjie Cao Fix For: 1.16.0
Currently, if one output of an upstream job vertex is consumed by multiple downstream job vertices, the upstream vertex will produce multiple dataset. For blocking shuffle, it means serialize and persist the same data multiple times. This ticket aims to optimize this behavior and make the upstream job vertex produce one dataset which will be read by multiple downstream vertex. -- This message was sent by Atlassian Jira (v8.20.10#820010)