Re: AskTimeoutException when canceling job with savepoint on flink 1.6.0

2018-09-05 Thread Gary Yao
Hi Jelmer, I saw that you have already found the JIRA issue tracking this problem [1] but I will still answer on the mailing list for transparency. The timeout for "cancel with savepoint" should be RpcUtils.INF_TIMEOUT. Unfortunately Flink is currently not respecting this timeout. A pull request

Re: AskTimeoutException when canceling job with savepoint on flink 1.6.0

2018-09-05 Thread vino yang
Hi Jelmer, Here's a similar question, and you can refer to the discussion options.[1] [1]: http://mail-archives.apache.org/mod_mbox/flink-user/201808.mbox/%3ccamjeyba9zjx_huqtlxdcu87hphrvrzxzoyjpqxzxdkq2h_k...@mail.gmail.com%3E Hi Till and Chesnay, Recently, several users have encountered this

AskTimeoutException when canceling job with savepoint on flink 1.6.0

2018-09-05 Thread jelmer
I am trying to upgrade a job from flink 1.4.2 to 1.6.0 When we do a deploy we cancel the job with a savepoint then deploy the new version of the job from that savepoint. Because our jobs tend to have a lot of state it often takes multiple minutes for our savepoints to complete. On flink 1.4.2 we