[ 
https://issues.apache.org/jira/browse/FLINK-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327635#comment-14327635
 ] 

ASF GitHub Bot commented on FLINK-1484:
---------------------------------------

GitHub user tillrohrmann opened a pull request:

    https://github.com/apache/flink/pull/423

    [FLINK-1484] [FLINK-1499] Adds explicit disconnect messages in case of an 
actor shutdown

    Introduces explicit disconnect messages which are sent from the 
JobManager/TaskManager to the TaskManager/JobManager in case of a graceful 
actor termination. These disconnect messages allow a faster recovery from 
failure in order to reach quickly a clean state.
    
    Contains minor Scala cleanups.
    
    This PR is based on #419 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tillrohrmann/flink taskManagerDisconnect

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/423.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #423
    
----
commit 8cc604d61d75370972146333c5a016b5fcdddc77
Author: Till Rohrmann <trohrm...@apache.org>
Date:   2015-02-19T10:04:56Z

    [FLINK-1584] [runtime][tests] Fixes TaskManagerFailsITCase by replacing the 
TestingCluster with a ForkableFlinkMiniCluster

commit 21660683633df999b86a7240929e07b8935e17df
Author: Till Rohrmann <trohrm...@apache.org>
Date:   2015-02-17T14:54:42Z

    [Flink-1484] [runtime] Adds explicit disconnect message for TaskManagers

commit b2ff739feb6915bb131d1aeac7ca772eb4f85cba
Author: Till Rohrmann <trohrm...@apache.org>
Date:   2015-02-17T15:36:34Z

    [FLINK-1499] [runtime] TaskManager sends explicit disconnect message to 
JobManager in case of shutdown

----


> JobManager restart does not notify the TaskManager
> --------------------------------------------------
>
>                 Key: FLINK-1484
>                 URL: https://issues.apache.org/jira/browse/FLINK-1484
>             Project: Flink
>          Issue Type: Bug
>            Reporter: Till Rohrmann
>
> In case of a JobManager restart, which can happen due to an uncaught 
> exception, the JobManager is restarted. However, connected TaskManager are 
> not informed about the disconnection and continue sending messages to a 
> JobManager with a reseted state. 
> TaskManager should be informed about a possible restart and cleanup their own 
> state in such a case. Afterwards, they can try to reconnect to a restarted 
> JobManager.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to