Zhanghao Chen created FLINK-33940: ------------------------------------- Summary: Update Update the auto-derivation rule of max parallelism for enlarged upscaling space Key: FLINK-33940 URL: https://issues.apache.org/jira/browse/FLINK-33940 Project: Flink Issue Type: Improvement Components: API / Core Reporter: Zhanghao Chen
*Background* The choice of the max parallelism of an stateful operator is important as it limits the upper bound of the parallelism of the opeartor while it can also add extra overhead when being set too large. Currently, the max parallelism of an opeartor is either fixed to a value specified by API core / pipeline option or auto-derived with the following rules: `min(max(roundUpToPowerOfTwo(operatorParallelism * 1.5), 128), 32767)` *Problem* Recently, the elasticity of Flink jobs is becoming more and more valued by users. The current auto-derived max parallelism was introduced a time time ago and only allows the operator parallelism to be roughly doubled, which is not desired for elasticity. Setting an max parallelism manually may not be desired as well: users may not have the sufficient expertise to select a good max-parallelism value. *Proposal* Update the auto-derivation rule of max parallelism to derive larger max parallelism for better elasticity experience out of the box. A candidate is as follows: `min(max(roundUpToPowerOfTwo(operatorParallelism * {*}5{*}), {*}1024{*}), 32767)` Looking forward to your opinions on this. -- This message was sent by Atlassian Jira (v8.20.10#820010)