[jira] [Commented] (FLINK-4142) Recovery problem in HA on Hadoop Yarn 2.4.1

ASF GitHub Bot (JIRA) Fri, 15 Jul 2016 02:23:00 -0700

    [ 
https://issues.apache.org/jira/browse/FLINK-4142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15379092#comment-15379092
 ]


ASF GitHub Bot commented on FLINK-4142:
---------------------------------------

GitHub user rmetzger opened a pull request:

    https://github.com/apache/flink/pull/2255

    [FLINK-4142][docs] Add warning about YARN HA bug

    There is a bug in YARN 2.4.0 preventing container starts from a re-started 
application master. 
    
    This is how the updated documentation looks like:
    
    
![image](https://cloud.githubusercontent.com/assets/89049/16869650/284d593a-4a7e-11e6-809d-157a5c49a5a3.png)


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/rmetzger/flink flink4142

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/2255.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2255
    
----
commit f7a0d18135c42cc428d1c7951ae69796bdee0fe2
Author: Robert Metzger <rmetz...@apache.org>
Date:   2016-07-15T09:20:17Z

    [FLINK-4142][docs] Add warning about YARN HA bug

----


> Recovery problem in HA on Hadoop Yarn 2.4.1
> -------------------------------------------
>
>                 Key: FLINK-4142
>                 URL: https://issues.apache.org/jira/browse/FLINK-4142
>             Project: Flink
>          Issue Type: Bug
>          Components: YARN Client
>    Affects Versions: 1.0.3
>            Reporter: Stefan Richter
>            Assignee: Robert Metzger
>
> On Hadoop Yarn 2.4.1, recovery in HA fails in the following scenario:
> 1) Kill application master, let it recover normally.
> 2) After that, kill a task manager.
> Now, Yarn tries to restart the killed task manager in an endless loop. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (FLINK-4142) Recovery problem in HA on Hadoop Yarn 2.4.1

Reply via email to