Re: Flink 1.10.0 failover

2020-04-26 Thread Zhu Zhu
ough resource if all old > taskmanager was released, > sometimes the job can create one, sometimes none. this never happen on > 1.8.2, i use same cluster and job, just different flink version. > the job may fail and auto-recovery. but in 1.10.0, it seems yarn miss some > taskmanager fai

Re: Flink 1.10.0 failover

2020-04-25 Thread Zhu Zhu
Sorry I did not quite understand the problem. Do you mean a failed job does not release resources to yarn? - if so, is the job in restarting process? A job in recovery will reuse the slots so they will not be release. Or a failed job cannot acquire slots when it is restarted in auto-recovery? - if

Flink 1.10.0 failover

2020-04-25 Thread seeksst
Hi, Recently, I find a problem when job failed in 1.10.0, flink didn’t release resource first. You can see I used flink on yarn, and it doesn’t allocate task manager, beacause no more memory left. If i cancel the job, the cluster has more memory. In 1.8.2, the job will restart normally