On 2019/03/25 08:30:00, "ryantaocer (JIRA)" <j...@apache.org> wrote: 
> ryantaocer created FLINK-12002:
> ----------------------------------
> 
>              Summary: Adaptive Parallelism of Job Vertex Execution
>                  Key: FLINK-12002
>                  URL: https://issues.apache.org/jira/browse/FLINK-12002
>              Project: Flink
>           Issue Type: Improvement
>             Reporter: ryantaocer
>             Assignee: BoWang
> 
> 
> In Flink the parallelism of job is a pre-specified parameter, which is 
> usually an empirical value and thus might not be optimal for both performance 
> and resource depending on the amount of data processed in each task.
> 
> Furthermore, a fixed parallelism cannot scale to varying data size common in 
> production cluster where we may not often change configurations. 
> 
> We propose to determine the job parallelism adaptive to the actual total 
> input data size and an ideal data size processed by each task. The ideal size 
> is pre-specified according to the properties of the operator such as the 
> preparation overhead compared with data processing time.
> 
> detailed design doc coming soon.
> 
> 
> 
> --
> This message was sent by Atlassian JIRA
> (v7.6.3#76005)
> Sounds great for maximizing resource usage! Expecting design doc in more 
> details.

Reply via email to