Aljoscha Krettek created FLINK-2317:
---------------------------------------

             Summary: Stream Jobs with Windows Leave Lingering State
                 Key: FLINK-2317
                 URL: https://issues.apache.org/jira/browse/FLINK-2317
             Project: Flink
          Issue Type: Bug
          Components: Streaming
    Affects Versions: 0.9, 0.9.1
            Reporter: Aljoscha Krettek
            Priority: Blocker


The problem is that some components in the windowing system create Threads that 
are not properly cleaned up after a Job finished (is canceled). Specifically 
these are the {{CentralCheck}} thread of the {{GroupedActiveDiscretizer}} and 
the {{TimeCheck}} thread in {{TimeTriggerPolicy}}. These threads have 
references to other parts of the windowing system, such as the window buffers. 
This then means that the complete state of the buffers at the time a job is 
canceled stays around indefinitely. Over time this will fill up the memory and 
make successive windowing jobs incredibly slow or lead to GC overheat 
exceptions.

Also, to prevent stuff like this from happening in the future we should put all 
streaming Tests on a common {{TestBase}} that verifies that we have no 
lingering threads after job execution.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to