[ https://issues.apache.org/jira/browse/FLINK-4142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15379092#comment-15379092 ]
ASF GitHub Bot commented on FLINK-4142: --------------------------------------- GitHub user rmetzger opened a pull request: https://github.com/apache/flink/pull/2255 [FLINK-4142][docs] Add warning about YARN HA bug There is a bug in YARN 2.4.0 preventing container starts from a re-started application master. This is how the updated documentation looks like:  You can merge this pull request into a Git repository by running: $ git pull https://github.com/rmetzger/flink flink4142 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/flink/pull/2255.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #2255 ---- commit f7a0d18135c42cc428d1c7951ae69796bdee0fe2 Author: Robert Metzger <rmetz...@apache.org> Date: 2016-07-15T09:20:17Z [FLINK-4142][docs] Add warning about YARN HA bug ---- > Recovery problem in HA on Hadoop Yarn 2.4.1 > ------------------------------------------- > > Key: FLINK-4142 > URL: https://issues.apache.org/jira/browse/FLINK-4142 > Project: Flink > Issue Type: Bug > Components: YARN Client > Affects Versions: 1.0.3 > Reporter: Stefan Richter > Assignee: Robert Metzger > > On Hadoop Yarn 2.4.1, recovery in HA fails in the following scenario: > 1) Kill application master, let it recover normally. > 2) After that, kill a task manager. > Now, Yarn tries to restart the killed task manager in an endless loop. -- This message was sent by Atlassian JIRA (v6.3.4#6332)