[ 
https://issues.apache.org/jira/browse/FLINK-34906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rui Fan updated FLINK-34906:
----------------------------
    Description: 
Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
the JobStatus will be RUNNING once job starts schedule, so it doesn't mean all 
tasks are running. Especially, when the resource isn't enough or job recovers 
from large state.

The autoscaler will throw exception and generate the AutoscalerError event when 
tasks are not ready, such as: 

 !image-2024-03-21-17-40-23-523.png! 


Also, we don't need to scale it when some tasks are not ready.

Solution: we only scale job that all tasks are running(some of tasks may be 
finished). 

  was:
Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
the JobStatus will be RUNNING once job starts schedule, so it doesn't mean all 
tasks are running. Especially, when the resource isn't enough or job recovers 
from large state.

The autoscaler will throw exception and generate the AutoscalerError event when 
tasks are not ready, such as: 

 !image-2024-03-21-17-40-23-523.png! 


Solution: we only scale job that all tasks are running(some of tasks may be 
finished). 


> Don't start autoscaling when some tasks are not running
> -------------------------------------------------------
>
>                 Key: FLINK-34906
>                 URL: https://issues.apache.org/jira/browse/FLINK-34906
>             Project: Flink
>          Issue Type: Improvement
>          Components: Autoscaler
>            Reporter: Rui Fan
>            Assignee: Rui Fan
>            Priority: Major
>             Fix For: 1.9.0
>
>         Attachments: image-2024-03-21-17-40-23-523.png
>
>
> Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
> the JobStatus will be RUNNING once job starts schedule, so it doesn't mean 
> all tasks are running. Especially, when the resource isn't enough or job 
> recovers from large state.
> The autoscaler will throw exception and generate the AutoscalerError event 
> when tasks are not ready, such as: 
>  !image-2024-03-21-17-40-23-523.png! 
> Also, we don't need to scale it when some tasks are not ready.
> Solution: we only scale job that all tasks are running(some of tasks may be 
> finished). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to