Till Rohrmann created FLINK-5197:
------------------------------------

             Summary: Late JobStatusChanges can interfere with running jobs
                 Key: FLINK-5197
                 URL: https://issues.apache.org/jira/browse/FLINK-5197
             Project: Flink
          Issue Type: Bug
          Components: JobManager
    Affects Versions: 1.1.3, 1.2.0
            Reporter: Till Rohrmann
            Assignee: Till Rohrmann
            Priority: Minor
             Fix For: 1.2.0, 1.1.4


When the {{JobManager}} receives a {{JobStatusChanged}} message, it will look 
up the {{ExecutionGraph}} for the given {{JobID}}. If there is no 
{{ExecutionGraph}}, then a {{RemoveJob}} message is sent to itself. In the 
general case, this is not problematic, because the {{RemoveJob}} message won't 
do anything if there is no {{ExecutionGraph}}. However, since this is an 
asynchronous call, it can happen that the corresponding job of the {{JobID}} is 
recovered before receiving the {{RemoveJob}} message. In this case, the newly 
recovered job would be removed.

I propose to change the behaviour such that a {{JobStatusChanged}} for a 
non-existing {{ExecutionGraph}} will be simply ignored.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to