Till Rohrmann created FLINK-9845:
------------------------------------

             Summary: Make InternalTimerService's timer processing 
interruptible/abortable
                 Key: FLINK-9845
                 URL: https://issues.apache.org/jira/browse/FLINK-9845
             Project: Flink
          Issue Type: Improvement
          Components: State Backends, Checkpointing
    Affects Versions: 1.5.1, 1.6.0
            Reporter: Till Rohrmann
             Fix For: 1.7.0


When cancelling a {{Task}}, the task thread might currently process the timers 
registered at the {{InternalTimerService}}. Depending on the timer action, this 
might take a while and, thus, blocks the cancellation of the {{Task}}. In the 
most extreme case, the {{TaskCancelerWatchDog}} kicks in and kills the whole 
{{TaskManager}} process.

In order to alleviate the problem (speed up the cancellation reaction), we 
should make the processing of the timers interruptible/abortable. This means 
that instead of processing all timers we should check in between timers whether 
the {{Task}} is currently being cancelled or not. If this is the case, then we 
should directly stop processing the remaining timers and return.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to