Re: Please suggest helpful tools

2020-01-13 Thread Kurt Young
g to see >>>>> why checkpoint failed, might be declined by some specific task. >>>>> >>>>> If checkpoint expired, you can also access the web UI to see which >>>>> tasks did not respond in time, some hot task might not be able to respond >>>>> in time. Gen

Re: Please suggest helpful tools

2020-01-13 Thread Eva Eva
nt barrier did not arrive in time. Resolve >>>> the back pressure could help the checkpoint finished before timeout. >>>> >>>> I think the doc of monitoring web UI for checkpoint [1] and back >>>> pressure [2] could help you. >>>> >>&g

Re: Please suggest helpful tools

2020-01-12 Thread Kurt Young
;> I think the doc of monitoring web UI for checkpoint [1] and back >>> pressure [2] could help you. >>> >>> [1] >>> https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/checkpoint_monitoring.html >>> [2] >>> https://ci.apache.org/projects/flink/flink-docs-r

Re: Please suggest helpful tools

2020-01-10 Thread Eva Eva
;> [1] >> https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/checkpoint_monitoring.html >> [2] >> https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/back_pressure.html >> >> Best >> Yun Tang >> --

Re: Please suggest helpful tools

2020-01-10 Thread Congxian Qiu
. > > [1] > https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/checkpoint_monitoring.html > [2] > https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/back_pressure.html > > Best > Yun Tang > -- > *From:* Ev

Re: Please suggest helpful tools

2020-01-10 Thread Yun Tang
:29 To: user Subject: Please suggest helpful tools Hi, I'm running Flink job on 1.9 version with blink planner. My checkpoints are timing out intermittently, but as state grows they are timing out more and more often eventually killing the job. Size of the state is large with Minimum=10.2M

Please suggest helpful tools

2020-01-09 Thread Eva Eva
Hi, I'm running Flink job on 1.9 version with blink planner. My checkpoints are timing out intermittently, but as state grows they are timing out more and more often eventually killing the job. Size of the state is large with Minimum=10.2MB and Maximum=49GB (this one is accumulated due to prior