Lijie Wang created FLINK-31079:
----------------------------------

             Summary: Release Testing: Verify FLINK-29663 Further improvements 
of adaptive batch scheduler
                 Key: FLINK-31079
                 URL: https://issues.apache.org/jira/browse/FLINK-31079
             Project: Flink
          Issue Type: Sub-task
          Components: Runtime / Coordination
            Reporter: Lijie Wang
             Fix For: 1.17.0


This task aims to verify FLINK-29663 which improves the adaptive batch 
scheduler.

Before the change of FLINK-29663, adaptive batch scheduler will distribute 
subpartitoins according to the number of subpartitions, make different 
downstream subtasks consume roughly the same number of subpartitions. This will 
lead to imbalance loads of different downstream tasks when the subpartitions 
contain different amounts of data.

To solve this problem, in FLINK-29663, we let the adaptive batch scheduler 
distribute subpartitoins according to the amount of data, so that different 
downstream subtasks consume roughly the same amount of data. Note that 
currently it only takes effect for All-To-All edges.

The documentation of adaptive scheduler can be found 
[here|https://nightlies.apache.org/flink/flink-docs-master/docs/deployment/elastic_scaling/#adaptive-batch-scheduler]

One can verify it by creating intended data skew on All-To-All edges.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to