[ 
https://issues.apache.org/jira/browse/FLINK-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853453#comment-15853453
 ] 

Syinchwun Leo commented on FLINK-5572:
--------------------------------------

I try to find a common way to abstract the States in WindowOperator, and it is 
very hard to find a balance between memory and performance. The current 
framework is a good method to express most States except ListState. ListState 
is commonly used in UDF Windows. ListState keeps raw data, and other States may 
keep intermediate data. It's hard to unite all States using a common way to 
keep, fire and clean up window. If we can keep the current framework, and only 
optimize each specific State?

> ListState in SlidingEventTimeWindow and SlidingProcessingTimeWindow 
> optimization
> --------------------------------------------------------------------------------
>
>                 Key: FLINK-5572
>                 URL: https://issues.apache.org/jira/browse/FLINK-5572
>             Project: Flink
>          Issue Type: Improvement
>          Components: DataStream API
>    Affects Versions: 1.2.1
>         Environment: CentOS 7.2
>            Reporter: Syinchwun Leo
>
> When using ListState in SlidingEventTimeWindow and 
> SlidingProcessingTimeWindow, an element  may be assigned to multiple 
> overlapped windows. It may lead to storage consuming. for example, 
> window(SlidingEventTimeWindows.of(Time.seconds(10), 
> Time.seconds(2))).apply(UDF window function), each element is assigned to 5 
> windows, When the window size is very large, it is unacceptable(size/slide is 
> very large).
> We plan to make a little optimization, and the doc is in 
> https://docs.google.com/document/d/1HCt1Si3YNGFwsl2H5SO0f7WD69DdBBPVJA6abd3oFWo/edit?usp=sharing
>  
> Comments?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to