Stephan Ewen created FLINK-17672:
------------------------------------

             Summary:  OperatorCoordinators receive failure notifications on 
task failurenstead of restarts
                 Key: FLINK-17672
                 URL: https://issues.apache.org/jira/browse/FLINK-17672
             Project: Flink
          Issue Type: Sub-task
          Components: Runtime / Coordination
            Reporter: Stephan Ewen
            Assignee: Stephan Ewen
             Fix For: 1.11.0


 Currently, the OperatorCoordinators receive failure notifications on task 
restart. That follows the same approach as the InputSplit assigners from the 
legacy sources (after which the integration of the Coordinators with the 
Scheduler was modeled).

However, propagating the failure notifications during the actual failure is 
more intuitive, and also improve situations where tasks fail but don't get 
restarted for a while (this can happen for batch tasks when a TM dies and no 
spare resources are available). In those cases, the coordinator can react much 
earlier to the failure.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to