fanxin created FLINK-17871: ------------------------------ Summary: Make the default value of attemptFailuresValidityInterval more reasonable Key: FLINK-17871 URL: https://issues.apache.org/jira/browse/FLINK-17871 Project: Flink Issue Type: Improvement Components: Deployment / YARN Reporter: fanxin
Default value of `yarn.application-attempt-failures-validity-interval` is `10000` milliseconds at present. Usually preparing the context alone can take seconds, which means that default value 10000 is too small even to ready a runtime context. With a default config, a flink on yarn job in will hardly meet the condition of ”fail 2 times in 10s“. If the job has some internal problems, unfortunately, it can easily get bogged down in endless retries. -- This message was sent by Atlassian Jira (v8.3.4#803005)