Hi, this, in fact, seems to be a bug. There should be something like windowState.clear(); for (IN element: projectedContents) { windowState.add(element); }
after passing the elements to the window function. This is very inefficient but the only way I see of doing it right now. Cheers, Aljoscha On Thu, 21 Jul 2016 at 01:32 Vishnu Viswanath <vishnu.viswanat...@gmail.com> wrote: > Hi, > > When we use RocksDB as state backend, how does the backend state get > updated after some elements are evicted from the window? > I don't see any update call being made to remove the element from the state > stored in RocksDB. > > It looks like the RocksDBListState is only having get() and add() methods > since it is an AppendingState, but that causes the evicted elements to come > back when the trigger is fired next time. (It works fine when I use > MemoryStateBackend) > > Is this expected behavior or am I missing something. > > Thanks, > Vishnu > > On Mon, Jul 18, 2016 at 7:15 AM, Vishnu Viswanath < > vishnu.viswanat...@gmail.com> wrote: > > > Hi Aljoscha, > > > > Thanks! Yes, I have the create page option now in wiki. > > > > Regards, > > Vishnu Viswanath, > > > > On Mon, Jul 18, 2016 at 6:34 AM, Aljoscha Krettek <aljos...@apache.org> > > wrote: > > > >> @Radu, addition of more window types and sorting should be part of > another > >> design proposal. This is interesting stuff but I think we should keep > >> issues separated because things can get complicated very quickly. > >> > >> On Mon, 18 Jul 2016 at 12:32 Aljoscha Krettek <aljos...@apache.org> > >> wrote: > >> > >> > Hi, > >> > about TimeEvictor, yes, I think there should be specific evictors for > >> > processing time and event time. Also, the current time should be > >> > retrievable from the EvictorContext. > >> > > >> > For the wiki you will need permissions. This was recently changed > >> because > >> > there was too much spam. I gave you permission to add pages. Can you > >> please > >> > try and check if it works? > >> > > >> > Cheers, > >> > Aljoscha > >> > > >> > On Fri, 15 Jul 2016 at 13:28 Vishnu Viswanath < > >> > vishnu.viswanat...@gmail.com> wrote: > >> > > >> >> Hi all, > >> >> > >> >> How do we create a FLIP page, is there any permission setup > required? I > >> >> don't see any "Create" page(after logging in) option in the header as > >> >> mentioned in > >> >> > >> >> > >> > https://cwiki.apache.org/confluence/display/FLINK/Flink+Improvement+Proposals > >> >> > >> >> > >> >> Thanks, > >> >> Vishnu > >> >> > >> >> On Wed, Jul 13, 2016 at 10:22 PM, Vishnu Viswanath < > >> >> vishnu.viswanat...@gmail.com> wrote: > >> >> > >> >> > Hi Aljoscha, > >> >> > > >> >> > I agree, the user will know exactly that they are creating an > >> EventTime > >> >> > based evictor or ProcessingTime based evictor looking at the code. > >> >> > So do you think it will be ok to have multiple versions of > >> TimeEvictor > >> >> > (one for event time and one for processing time) and also a > >> DeltaEvcitor > >> >> > (again 2 versions- for event time and processing time) ? > >> >> > > >> >> > Please note that the existing behavior of TimeEvictor/DeltaEvictor > >> does > >> >> > not consider if it is EventTime or ProcessingTime > >> >> > e.g., in TimeEvictor the current time is considered as the > timestamp > >> of > >> >> > the last element in the window > >> >> > > >> >> > *long currentTime = Iterables.getLast(elements).getTimestamp();* > >> >> > > >> >> > not the highest timestamp of all elements > >> >> > what I am trying to achieve is something like: > >> >> > > >> >> > *long currentTime;* > >> >> > * if (ctx.isEventTime()) {* > >> >> > * currentTime = getMaxTimestamp(elements);* > >> >> > * } else {* > >> >> > * currentTime = Iterables.getLast(elements).getTimestamp();* > >> >> > * }* > >> >> > > >> >> > Similarly, in DeltaEvictor the *`lastElement`* is > >> >> > *`Iterables.getLast(elements);`* and I am thinking we should > consider > >> >> the > >> >> > element with max timestamp as the last element instead of just > >> getting > >> >> the > >> >> > last inserted element as *`lastElement`* > >> >> > > >> >> > Do you think it is the right thing to do or leave the behavior > >> Evictors > >> >> as > >> >> > is, w.r.t to choosing the last element? > >> >> > > >> >> > Thanks, > >> >> > Vishnu > >> >> > > >> >> > On Wed, Jul 13, 2016 at 11:07 AM, Aljoscha Krettek < > >> aljos...@apache.org > >> >> > > >> >> > wrote: > >> >> > > >> >> >> I still think it should be explicit in the class. For example, if > >> you > >> >> have > >> >> >> this code: > >> >> >> > >> >> >> input > >> >> >> .keyBy() > >> >> >> .window() > >> >> >> .trigger(EventTimeTrigger.create()) > >> >> >> .evictor(TimeTrigger.create()) > >> >> >> > >> >> >> the time behavior of the trigger is explicitly specified while the > >> >> evictor > >> >> >> would dynamically adapt based on internal workings that the user > >> might > >> >> not > >> >> >> be aware of. Having the behavior explicit at the call site is very > >> >> >> important, in my opinion. > >> >> >> > >> >> >> On Wed, 13 Jul 2016 at 16:28 Vishnu Viswanath < > >> >> >> vishnu.viswanat...@gmail.com> > >> >> >> wrote: > >> >> >> > >> >> >> > Hi, > >> >> >> > > >> >> >> > I was hoping to use the isEventTime method in the WindowAssigner > >> to > >> >> set > >> >> >> > that information in the EvictorContext. > >> >> >> > What do you think?. > >> >> >> > > >> >> >> > Thanks and Regards, > >> >> >> > Vishnu Viswanath, > >> >> >> > > >> >> >> > On Wed, Jul 13, 2016 at 10:09 AM, Aljoscha Krettek < > >> >> aljos...@apache.org > >> >> >> > > >> >> >> > wrote: > >> >> >> > > >> >> >> > > Hi, > >> >> >> > > I think the way to go here is to add both an EventTimeEvictor > >> and a > >> >> >> > > ProcessingTimeEvictor. The problem is that "isEventTime" > cannot > >> >> >> really be > >> >> >> > > determined. That's also the reason why there is an > >> EventTimeTrigger > >> >> >> and a > >> >> >> > > ProcessingTimeTrigger. It was just an oversight that the > >> >> TimeEvictor > >> >> >> does > >> >> >> > > not also have these two versions. > >> >> >> > > > >> >> >> > > About EvictingWindowOperator, I think you can make the two > >> methods > >> >> >> > > non-final in WindowOperator, yes. > >> >> >> > > > >> >> >> > > Cheers, > >> >> >> > > Aljoscha > >> >> >> > > > >> >> >> > > On Wed, 13 Jul 2016 at 14:32 Vishnu Viswanath < > >> >> >> > > vishnu.viswanat...@gmail.com> > >> >> >> > > wrote: > >> >> >> > > > >> >> >> > > > Hi Aljoscha, > >> >> >> > > > > >> >> >> > > > I am thinking of adding a method boolean isEventTime(); in > the > >> >> >> > > > EvictorContext apart from > >> >> >> > > > > >> >> >> > > > long getCurrentProcessingTime(); > >> >> >> > > > MetricGroup getMetricGroup(); > >> >> >> > > > long getCurrentWatermark(); > >> >> >> > > > > >> >> >> > > > This method can be used to make the Evictor not iterate > >> through > >> >> all > >> >> >> the > >> >> >> > > > elements in TimeEvictor. There will be a few changes in the > >> >> existing > >> >> >> > > > behavior of TimeEvictor and DeltaEvictor (I have mentioned > >> this > >> >> in > >> >> >> the > >> >> >> > > > design doc) > >> >> >> > > > > >> >> >> > > > Also, is there any specific reason why the open and close > >> method > >> >> in > >> >> >> > > > WindowEvictor is made final? Since the EvictorContext will > be > >> in > >> >> the > >> >> >> > > > EvictingWindowOperator, I need to override the open and > close > >> in > >> >> >> > > > EvitingWindowOperator to make the reference of > EvictorContext > >> >> null. > >> >> >> > > > > >> >> >> > > > Thanks and Regards, > >> >> >> > > > Vishnu Viswanath, > >> >> >> > > > > >> >> >> > > > On Fri, Jul 8, 2016 at 7:40 PM, Vishnu Viswanath < > >> >> >> > > > vishnu.viswanat...@gmail.com> wrote: > >> >> >> > > > > >> >> >> > > > My thought process when asking if we can use state backend > in > >> >> window > >> >> >> > > > > function was : can we add the elements to be evicted into > >> some > >> >> >> state > >> >> >> > > and > >> >> >> > > > > allow the evictAfter to read it from some context and > >> remove it > >> >> >> from > >> >> >> > > the > >> >> >> > > > > window? > >> >> >> > > > > > >> >> >> > > > > > >> >> >> > > > > On Fri, Jul 8, 2016 at 7:30 PM, Vishnu Viswanath < > >> >> >> > > > > vishnu.viswanat...@gmail.com> wrote: > >> >> >> > > > > > >> >> >> > > > >> Hi Aljoscha, > >> >> >> > > > >> > >> >> >> > > > >> Thanks for the explanation, and sorry for late reply was > >> busy > >> >> >> with > >> >> >> > > work. > >> >> >> > > > >> > >> >> >> > > > >> I did think about this scenario, in fact in my previous > >> mail I > >> >> >> > thought > >> >> >> > > > of > >> >> >> > > > >> posting this question, then I understood that this > problem > >> >> will > >> >> >> be > >> >> >> > > > >> there which ever method we choose(Trigger looking for > >> pattern > >> >> or > >> >> >> > > Window > >> >> >> > > > >> looking for pattern). > >> >> >> > > > >> > >> >> >> > > > >> I do have a pretty good watermark but my concern is that > it > >> >> >> changes > >> >> >> > > > based > >> >> >> > > > >> on the key of these messages(I don't know if it is > >> possible, > >> >> >> haven't > >> >> >> > > > >> started coding that yet. May be you could tell me). Even > if > >> >> it is > >> >> >> > yes > >> >> >> > > > some > >> >> >> > > > >> of these watermarks will be long(in days), which I don't > >> want > >> >> the > >> >> >> > > > trigger > >> >> >> > > > >> to wait that long. > >> >> >> > > > >> > >> >> >> > > > >> It looks like it is not easy to have an evictAfter based > on > >> >> >> window > >> >> >> > > > >> function(without introducing coupling), but can the > current > >> >> >> window > >> >> >> > > apply > >> >> >> > > > >> function be modified to allow it to change the elements > in > >> it > >> >> - > >> >> >> may > >> >> >> > be > >> >> >> > > > >> using some state backend(I don't know how excatly the > >> >> internals > >> >> >> of > >> >> >> > > these > >> >> >> > > > >> work, so this might be a wrong question) > >> >> >> > > > >> > >> >> >> > > > >> Thanks and Regards, > >> >> >> > > > >> Vishnu Viswanath, > >> >> >> > > > >> > >> >> >> > > > >> On Fri, Jul 8, 2016 at 10:20 AM, Aljoscha Krettek < > >> >> >> > > aljos...@apache.org> > >> >> >> > > > >> wrote: > >> >> >> > > > >> > >> >> >> > > > >>> Hi Vishnu, > >> >> >> > > > >>> how long would these patterns be? The Trigger would not > >> have > >> >> to > >> >> >> > sort > >> >> >> > > > the > >> >> >> > > > >>> elements for every new element but just insert the new > >> >> element > >> >> >> into > >> >> >> > > an > >> >> >> > > > >>> internal data structure. Only when it sees that the > >> >> watermark is > >> >> >> > > past a > >> >> >> > > > >>> certain point would it check whether the pattern matches > >> and > >> >> >> > actually > >> >> >> > > > >>> Trigger. > >> >> >> > > > >>> > >> >> >> > > > >>> A general note regarding order and event time: I think > >> >> relying > >> >> >> on > >> >> >> > > this > >> >> >> > > > >>> for > >> >> >> > > > >>> computation is very tricky unless the watermark is 100 % > >> >> >> correct or > >> >> >> > > you > >> >> >> > > > >>> completely discard elements that arrive after the > >> watermark, > >> >> >> i.e. > >> >> >> > > > >>> elements > >> >> >> > > > >>> that would break the promise of the watermark that no > >> >> elements > >> >> >> with > >> >> >> > > an > >> >> >> > > > >>> earlier timestamp will ever arrive. The reason for this > is > >> >> that > >> >> >> > there > >> >> >> > > > >>> could > >> >> >> > > > >>> always enter new elements that end up between already > seen > >> >> >> > elements. > >> >> >> > > > For > >> >> >> > > > >>> example, let's say we have this sequence of elements > when > >> the > >> >> >> > trigger > >> >> >> > > > >>> fires: > >> >> >> > > > >>> > >> >> >> > > > >>> a-b-a > >> >> >> > > > >>> > >> >> >> > > > >>> This is the sequence that you are looking for and you > emit > >> >> some > >> >> >> > > result > >> >> >> > > > >>> from > >> >> >> > > > >>> the WindowFunction. Now, new elements arrive that fall > in > >> >> >> between > >> >> >> > the > >> >> >> > > > >>> elements we already have: > >> >> >> > > > >>> > >> >> >> > > > >>> a-d-e-b-f-g-a > >> >> >> > > > >>> > >> >> >> > > > >>> This is an updated, sorted view of the actual event-time > >> >> stream > >> >> >> and > >> >> >> > > we > >> >> >> > > > >>> didn't realize that the stream actually looks like this > >> >> before. > >> >> >> > Does > >> >> >> > > > this > >> >> >> > > > >>> still match the original pattern or should we now > consider > >> >> this > >> >> >> as > >> >> >> > > > >>> non-matching? If no, then the earlier successful match > for > >> >> a-b-a > >> >> >> > was > >> >> >> > > > >>> wrong > >> >> >> > > > >>> and we should never have processed it but we didn't know > >> at > >> >> the > >> >> >> > time. > >> >> >> > > > If > >> >> >> > > > >>> yes, then pattern matching like this can be done in the > >> >> Trigger > >> >> >> by > >> >> >> > > > having > >> >> >> > > > >>> something like pattern slots: You don't have to store > all > >> >> >> elements > >> >> >> > in > >> >> >> > > > the > >> >> >> > > > >>> Trigger, you just need to store possible candidates that > >> >> could > >> >> >> > match > >> >> >> > > > the > >> >> >> > > > >>> pattern and ignore the other (in-between) elements. > >> >> >> > > > >>> > >> >> >> > > > >>> Cheers, > >> >> >> > > > >>> Aljoscha > >> >> >> > > > >>> > >> >> >> > > > >>> On Fri, 8 Jul 2016 at 14:10 Vishnu Viswanath < > >> >> >> > > > >>> vishnu.viswanat...@gmail.com> > >> >> >> > > > >>> wrote: > >> >> >> > > > >>> > >> >> >> > > > >>> > Hi Aljoscha, > >> >> >> > > > >>> > > >> >> >> > > > >>> > That is a good idea, trying to tie it back to the use > >> case, > >> >> >> > > > >>> > e.g., suppose trigger is looking for a pattern, a-b-a > >> and > >> >> >> when it > >> >> >> > > > sees > >> >> >> > > > >>> such > >> >> >> > > > >>> > a pattern, it will trigger the window and it knows > that > >> now > >> >> >> the > >> >> >> > > > >>> Evictor is > >> >> >> > > > >>> > going to evict the element b, and trigger updates its > >> >> state as > >> >> >> > a-a > >> >> >> > > > >>> (even > >> >> >> > > > >>> > before the window & evictor completes) and will be > >> looking > >> >> for > >> >> >> > the > >> >> >> > > > >>> rest of > >> >> >> > > > >>> > the pattern i.e., b-a. But I can think of 1 problem > >> here, > >> >> >> > > > >>> > > >> >> >> > > > >>> > - the events can arrive out of order, i.e., the > >> trigger > >> >> >> might > >> >> >> > be > >> >> >> > > > >>> seeing > >> >> >> > > > >>> > a pattern a-a-b but actual event time is a-b-a then > >> >> trigger > >> >> >> > will > >> >> >> > > > >>> have to > >> >> >> > > > >>> > sort the elements in the window everytime it sees > an > >> >> >> element. > >> >> >> > (I > >> >> >> > > > was > >> >> >> > > > >>> > planning to do this sorting in the window, which > >> will be > >> >> >> less > >> >> >> > > > often > >> >> >> > > > >>> - > >> >> >> > > > >>> > only > >> >> >> > > > >>> > when the trigger fires) > >> >> >> > > > >>> > > >> >> >> > > > >>> > Thanks and Regards, > >> >> >> > > > >>> > Vishnu Viswanath, > >> >> >> > > > >>> > > >> >> >> > > > >>> > On Fri, Jul 8, 2016 at 6:04 AM, Aljoscha Krettek < > >> >> >> > > > aljos...@apache.org> > >> >> >> > > > >>> > wrote: > >> >> >> > > > >>> > > >> >> >> > > > >>> > Hi, > >> >> >> > > > >>> > > come to think of it, the right place to put such > >> checks > >> >> is > >> >> >> > > actually > >> >> >> > > > >>> the > >> >> >> > > > >>> > > Trigger. It would have to be a custom trigger that > >> >> observes > >> >> >> > time > >> >> >> > > > but > >> >> >> > > > >>> also > >> >> >> > > > >>> > > keeps some internal state machine to decide when it > >> has > >> >> >> > observed > >> >> >> > > > the > >> >> >> > > > >>> > right > >> >> >> > > > >>> > > pattern in the window. Then the window function > would > >> >> just > >> >> >> have > >> >> >> > > to > >> >> >> > > > >>> do the > >> >> >> > > > >>> > > processing and you have good separation of concerns. > >> Does > >> >> >> that > >> >> >> > > make > >> >> >> > > > >>> > sense? > >> >> >> > > > >>> > > > >> >> >> > > > >>> > > I'm ignoring time and sorting by time for now > because > >> we > >> >> >> > probably > >> >> >> > > > >>> need > >> >> >> > > > >>> > > another design document for that. To me it seems > like > >> a > >> >> >> bigger > >> >> >> > > > thing. > >> >> >> > > > >>> > > > >> >> >> > > > >>> > > Cheers, > >> >> >> > > > >>> > > Aljoscha > >> >> >> > > > >>> > > > >> >> >> > > > >>> > > On Thu, 7 Jul 2016 at 23:56 Vishnu Viswanath < > >> >> >> > > > >>> > vishnu.viswanat...@gmail.com > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > wrote: > >> >> >> > > > >>> > > > >> >> >> > > > >>> > > > Hi, > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > > Regarding the evictAfter function, that evicts > >> based on > >> >> >> some > >> >> >> > > > >>> decision > >> >> >> > > > >>> > > made > >> >> >> > > > >>> > > > by the window function: I think it will be nice > if > >> we > >> >> can > >> >> >> > come > >> >> >> > > > up > >> >> >> > > > >>> with > >> >> >> > > > >>> > > > something that is LESS coupled, because I can > think > >> of > >> >> >> > several > >> >> >> > > > use > >> >> >> > > > >>> > cases > >> >> >> > > > >>> > > > that depend on it. > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > > Especially in the case where there are late > arriving > >> >> >> > messages. > >> >> >> > > > Only > >> >> >> > > > >>> > after > >> >> >> > > > >>> > > > the window function is applied we could tell what > >> to do > >> >> >> with > >> >> >> > > the > >> >> >> > > > >>> > elements > >> >> >> > > > >>> > > > in the window. You could apply your business logic > >> >> there > >> >> >> to > >> >> >> > > > >>> determine > >> >> >> > > > >>> > if > >> >> >> > > > >>> > > > the window funciton was able to do what it is > >> supposed > >> >> to > >> >> >> do, > >> >> >> > > if > >> >> >> > > > >>> yes > >> >> >> > > > >>> > > evict > >> >> >> > > > >>> > > > those elements, else(since the elements you are > >> looking > >> >> >> for > >> >> >> > > > haven't > >> >> >> > > > >>> > > arrived > >> >> >> > > > >>> > > > yet) wait and try again when the trigger gets > fired > >> >> next > >> >> >> > time. > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > > Thanks and Regards, > >> >> >> > > > >>> > > > Vishnu Viswanath, > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > > On Thu, Jul 7, 2016 at 9:19 AM, Radu Tudoran < > >> >> >> > > > >>> radu.tudo...@huawei.com> > >> >> >> > > > >>> > > > wrote: > >> >> >> > > > >>> > > > > >> >> >> > > > >>> > > > > Hi, > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > @Aljoscha - I can understand the reason why you > >> are > >> >> >> > hesitant > >> >> >> > > to > >> >> >> > > > >>> > > introduce > >> >> >> > > > >>> > > > > "slower" windows such as the ones that would > >> maintain > >> >> >> > sorted > >> >> >> > > > >>> items or > >> >> >> > > > >>> > > > > windows with bindings between the different > >> entities > >> >> >> > > (evictor, > >> >> >> > > > >>> > trigger, > >> >> >> > > > >>> > > > > window, apply function). However, I think it's > >> >> possible > >> >> >> > just > >> >> >> > > to > >> >> >> > > > >>> > create > >> >> >> > > > >>> > > > more > >> >> >> > > > >>> > > > > types of windows. The existing ones > (timewindows, > >> >> global > >> >> >> > > > windows > >> >> >> > > > >>> ...) > >> >> >> > > > >>> > > can > >> >> >> > > > >>> > > > > remain, and just add some more flavors of > windows > >> >> were > >> >> >> more > >> >> >> > > > >>> features > >> >> >> > > > >>> > > are > >> >> >> > > > >>> > > > > enabled or more functionality (e.g., access to > the > >> >> each > >> >> >> > > element > >> >> >> > > > >>> in > >> >> >> > > > >>> > the > >> >> >> > > > >>> > > > > evictor ; possibility to delete or mark for > >> eviction > >> >> >> > elements > >> >> >> > > > in > >> >> >> > > > >>> the > >> >> >> > > > >>> > > > > function...) > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > Regarding the specific case of sorted windows, I > >> >> think > >> >> >> the > >> >> >> > N > >> >> >> > > > lon > >> >> >> > > > >>> N > >> >> >> > > > >>> > > > > complexity to sort (the worst case) is very > >> >> unlikely. In > >> >> >> > fact > >> >> >> > > > you > >> >> >> > > > >>> > have > >> >> >> > > > >>> > > > > almost sorted items/arrays. Moreover, if you > >> consider > >> >> >> that > >> >> >> > in > >> >> >> > > > >>> > > iteration X > >> >> >> > > > >>> > > > > all elements were sorted, then in iteration X+1 > >> you > >> >> will > >> >> >> > need > >> >> >> > > > to > >> >> >> > > > >>> sort > >> >> >> > > > >>> > > > just > >> >> >> > > > >>> > > > > the newly arrived elements (M). I would expect > >> that > >> >> this > >> >> >> > > > number M > >> >> >> > > > >>> > might > >> >> >> > > > >>> > > > be > >> >> >> > > > >>> > > > > significant smaller then N (elements that > exists). > >> >> Then > >> >> >> > using > >> >> >> > > > an > >> >> >> > > > >>> > > > insertion > >> >> >> > > > >>> > > > > sort for these new elements you would have M * > N > >> >> >> > complexity > >> >> >> > > > and > >> >> >> > > > >>> if > >> >> >> > > > >>> > > M<< N > >> >> >> > > > >>> > > > > then the complexity is O(N). Alternatively you > can > >> >> use a > >> >> >> > > binary > >> >> >> > > > >>> > search > >> >> >> > > > >>> > > > for > >> >> >> > > > >>> > > > > insertion and then you further reduce the > >> complexity > >> >> to > >> >> >> > > > O(logN). > >> >> >> > > > >>> > > > > If M is proportional to N then you can sort M > and > >> use > >> >> >> merge > >> >> >> > > > sort > >> >> >> > > > >>> for > >> >> >> > > > >>> > > > > combining. > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > Dr. Radu Tudoran > >> >> >> > > > >>> > > > > Research Engineer - Big Data Expert > >> >> >> > > > >>> > > > > IT R&D Division > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > HUAWEI TECHNOLOGIES Duesseldorf GmbH > >> >> >> > > > >>> > > > > European Research Center > >> >> >> > > > >>> > > > > Riesstrasse 25, 80992 München > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > E-mail: radu.tudo...@huawei.com > >> >> >> > > > >>> > > > > Mobile: +49 15209084330 > >> >> >> > > > >>> > > > > Telephone: +49 891588344173 > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > HUAWEI TECHNOLOGIES Duesseldorf GmbH > >> >> >> > > > >>> > > > > Hansaallee 205, 40549 Düsseldorf, Germany, > >> >> >> www.huawei.com > >> >> >> > > > >>> > > > > Registered Office: Düsseldorf, Register Court > >> >> >> Düsseldorf, > >> >> >> > HRB > >> >> >> > > > >>> 56063, > >> >> >> > > > >>> > > > > Managing Director: Bo PENG, Wanzhou MENG, Lifang > >> CHEN > >> >> >> > > > >>> > > > > Sitz der Gesellschaft: Düsseldorf, Amtsgericht > >> >> >> Düsseldorf, > >> >> >> > > HRB > >> >> >> > > > >>> 56063, > >> >> >> > > > >>> > > > > Geschäftsführer: Bo PENG, Wanzhou MENG, Lifang > >> CHEN > >> >> >> > > > >>> > > > > This e-mail and its attachments contain > >> confidential > >> >> >> > > > information > >> >> >> > > > >>> from > >> >> >> > > > >>> > > > > HUAWEI, which is intended only for the person or > >> >> entity > >> >> >> > whose > >> >> >> > > > >>> address > >> >> >> > > > >>> > > is > >> >> >> > > > >>> > > > > listed above. Any use of the information > contained > >> >> >> herein > >> >> >> > in > >> >> >> > > > any > >> >> >> > > > >>> way > >> >> >> > > > >>> > > > > (including, but not limited to, total or partial > >> >> >> > disclosure, > >> >> >> > > > >>> > > > reproduction, > >> >> >> > > > >>> > > > > or dissemination) by persons other than the > >> intended > >> >> >> > > > >>> recipient(s) is > >> >> >> > > > >>> > > > > prohibited. If you receive this e-mail in error, > >> >> please > >> >> >> > > notify > >> >> >> > > > >>> the > >> >> >> > > > >>> > > sender > >> >> >> > > > >>> > > > > by phone or email immediately and delete it! > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > -----Original Message----- > >> >> >> > > > >>> > > > > From: 吕文龙(吕文龙) [mailto: > >> wenlong....@alibaba-inc.com] > >> >> >> > > > >>> > > > > Sent: Thursday, July 07, 2016 11:59 AM > >> >> >> > > > >>> > > > > To: dev@flink.apache.org > >> >> >> > > > >>> > > > > Subject: 答复: [DISCUSS] Enhance Window Evictor in > >> >> Flink > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > HI, > >> >> >> > > > >>> > > > > I think it is necessary to support sorted > window, > >> >> which > >> >> >> can > >> >> >> > > > avoid > >> >> >> > > > >>> > > > scanning > >> >> >> > > > >>> > > > > all the elements of window while trying to > >> evicting > >> >> >> > element, > >> >> >> > > > >>> which > >> >> >> > > > >>> > may > >> >> >> > > > >>> > > > cost > >> >> >> > > > >>> > > > > many IO operations, such as querying DBs to get > >> >> elements > >> >> >> > from > >> >> >> > > > >>> state. > >> >> >> > > > >>> > > > > What's more, when an window aggregation function > >> is > >> >> >> > > invertible, > >> >> >> > > > >>> such > >> >> >> > > > >>> > as > >> >> >> > > > >>> > > > > sum, which can be updated by adding or removing > a > >> >> single > >> >> >> > > > record, > >> >> >> > > > >>> > window > >> >> >> > > > >>> > > > > results can be incrementally calculated. In this > >> >> kind of > >> >> >> > > case, > >> >> >> > > > >>> we can > >> >> >> > > > >>> > > > > dramatically improve the performance of window > >> >> >> aggregation, > >> >> >> > > if > >> >> >> > > > >>> > evictor > >> >> >> > > > >>> > > > can > >> >> >> > > > >>> > > > > trigger update of window aggregation state by > some > >> >> >> > mechanism. > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > Best Wishes! > >> >> >> > > > >>> > > > > --- > >> >> >> > > > >>> > > > > wenlong.lwl > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > -----邮件原件----- > >> >> >> > > > >>> > > > > 发件人: Aljoscha Krettek [mailto: > aljos...@apache.org > >> ] > >> >> >> > > > >>> > > > > 发送时间: 2016年7月7日 17:32 > >> >> >> > > > >>> > > > > 收件人: dev@flink.apache.org > >> >> >> > > > >>> > > > > 主题: Re: [DISCUSS] Enhance Window Evictor in > Flink > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > Hi, > >> >> >> > > > >>> > > > > regarding "sorting the window by event time": I > >> also > >> >> >> > > considered > >> >> >> > > > >>> this > >> >> >> > > > >>> > > but > >> >> >> > > > >>> > > > > in the end I don't think it's necessary. Sorting > >> is > >> >> >> rather > >> >> >> > > > >>> expensive > >> >> >> > > > >>> > > and > >> >> >> > > > >>> > > > > making decisions based on the order of elements > >> can > >> >> be > >> >> >> > > tricky. > >> >> >> > > > An > >> >> >> > > > >>> > > extreme > >> >> >> > > > >>> > > > > example of why this can be problematic is the > case > >> >> where > >> >> >> > all > >> >> >> > > > >>> elements > >> >> >> > > > >>> > > in > >> >> >> > > > >>> > > > > the window have the same timestamp. Now, if you > >> >> decide > >> >> >> to > >> >> >> > > evict > >> >> >> > > > >>> the > >> >> >> > > > >>> > > > first 5 > >> >> >> > > > >>> > > > > elements based on timestamp order you basically > >> >> >> arbitrarily > >> >> >> > > > >>> evict 5 > >> >> >> > > > >>> > > > > elements. I think the better solution for doing > >> >> >> time-based > >> >> >> > > > >>> eviction > >> >> >> > > > >>> > is > >> >> >> > > > >>> > > to > >> >> >> > > > >>> > > > > do one pass over the elements to get an overview > >> of > >> >> the > >> >> >> > > > timestamp > >> >> >> > > > >>> > > > > distribution, then do a second pass and evict > >> >> elements > >> >> >> > based > >> >> >> > > on > >> >> >> > > > >>> what > >> >> >> > > > >>> > > was > >> >> >> > > > >>> > > > > learned in the first pass. This has complexity > 2*n > >> >> >> compared > >> >> >> > > to > >> >> >> > > > >>> the > >> >> >> > > > >>> > > n*log > >> >> >> > > > >>> > > > n > >> >> >> > > > >>> > > > > (plus the work of actually deciding what to > >> evict) of > >> >> >> the > >> >> >> > > sort > >> >> >> > > > >>> based > >> >> >> > > > >>> > > > > strategy. > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > I might be wrong, though, and there could be a > >> valid > >> >> >> > use-case > >> >> >> > > > not > >> >> >> > > > >>> > > covered > >> >> >> > > > >>> > > > > by the above idea. > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > regarding Vishnu's other use case of evicting > >> based > >> >> on > >> >> >> some > >> >> >> > > > >>> decision > >> >> >> > > > >>> > in > >> >> >> > > > >>> > > > the > >> >> >> > > > >>> > > > > WindowFunction: could this be solved by doing > the > >> >> check > >> >> >> for > >> >> >> > > the > >> >> >> > > > >>> > pattern > >> >> >> > > > >>> > > > in > >> >> >> > > > >>> > > > > the evictor itself instead of in the window > >> function? > >> >> >> I'm > >> >> >> > > very > >> >> >> > > > >>> > hesitant > >> >> >> > > > >>> > > > to > >> >> >> > > > >>> > > > > introduce a coupling between the different > >> >> components of > >> >> >> > the > >> >> >> > > > >>> > windowing > >> >> >> > > > >>> > > > > system, i.e. assigner, trigger, evictor and > window > >> >> >> > function. > >> >> >> > > > The > >> >> >> > > > >>> > reason > >> >> >> > > > >>> > > > is > >> >> >> > > > >>> > > > > that using an evictor has a huge performance > >> impact > >> >> >> since > >> >> >> > the > >> >> >> > > > >>> system > >> >> >> > > > >>> > > > always > >> >> >> > > > >>> > > > > has to keep all elements and cannot to > incremental > >> >> >> > > aggregation > >> >> >> > > > of > >> >> >> > > > >>> > > window > >> >> >> > > > >>> > > > > results and I therefore don't want to put > specific > >> >> >> features > >> >> >> > > > >>> regarding > >> >> >> > > > >>> > > > > eviction into the other components. > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > Cheers, > >> >> >> > > > >>> > > > > Aljoscha > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > On Thu, 7 Jul 2016 at 10:00 Radu Tudoran < > >> >> >> > > > >>> radu.tudo...@huawei.com> > >> >> >> > > > >>> > > > wrote: > >> >> >> > > > >>> > > > > > >> >> >> > > > >>> > > > > > Hi, > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > I think the situation Vishnu raised is > something > >> >> that > >> >> >> > > should > >> >> >> > > > be > >> >> >> > > > >>> > > > > accounted. > >> >> >> > > > >>> > > > > > It can happen indeed that you want to > condition > >> >> what > >> >> >> you > >> >> >> > > > evict > >> >> >> > > > >>> from > >> >> >> > > > >>> > > > > > the window based on the result of the function > >> to > >> >> be > >> >> >> > > applied. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > My 2 cents... > >> >> >> > > > >>> > > > > > I would suggest adding a list for the elements > >> of > >> >> the > >> >> >> > > stream > >> >> >> > > > >>> where > >> >> >> > > > >>> > > you > >> >> >> > > > >>> > > > > > can MARK them to be delete. Alternatively the > >> >> iterator > >> >> >> > can > >> >> >> > > be > >> >> >> > > > >>> > > extended > >> >> >> > > > >>> > > > > > to have a function > >> Iterator.markForEviction(int); > >> >> >> These > >> >> >> > can > >> >> >> > > > be > >> >> >> > > > >>> made > >> >> >> > > > >>> > > > > > available also in the apply function. > Moreover, > >> we > >> >> can > >> >> >> > use > >> >> >> > > > >>> this to > >> >> >> > > > >>> > > > > > extend the functionality such that you add > MARKs > >> >> >> either > >> >> >> > for > >> >> >> > > > >>> > eviction > >> >> >> > > > >>> > > > > > after the function has finished triggering or > >> to be > >> >> >> > evicted > >> >> >> > > > in > >> >> >> > > > >>> the > >> >> >> > > > >>> > > next > >> >> >> > > > >>> > > > > iteration. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > Dr. Radu Tudoran > >> >> >> > > > >>> > > > > > Research Engineer - Big Data Expert > >> >> >> > > > >>> > > > > > IT R&D Division > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > HUAWEI TECHNOLOGIES Duesseldorf GmbH > >> >> >> > > > >>> > > > > > European Research Center > >> >> >> > > > >>> > > > > > Riesstrasse 25, 80992 München > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > E-mail: radu.tudo...@huawei.com > >> >> >> > > > >>> > > > > > Mobile: +49 15209084330 > >> >> >> > > > >>> > > > > > Telephone: +49 891588344173 > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > HUAWEI TECHNOLOGIES Duesseldorf GmbH > >> >> >> > > > >>> > > > > > Hansaallee 205, 40549 Düsseldorf, Germany, > >> >> >> > www.huawei.com > >> >> >> > > > >>> > Registered > >> >> >> > > > >>> > > > > > Office: Düsseldorf, Register Court Düsseldorf, > >> HRB > >> >> >> 56063, > >> >> >> > > > >>> Managing > >> >> >> > > > >>> > > > > > Director: Bo PENG, Wanzhou MENG, Lifang CHEN > >> Sitz > >> >> der > >> >> >> > > > >>> Gesellschaft: > >> >> >> > > > >>> > > > > > Düsseldorf, Amtsgericht Düsseldorf, HRB 56063, > >> >> >> > > > >>> > > > > > Geschäftsführer: Bo PENG, Wanzhou MENG, Lifang > >> CHEN > >> >> >> This > >> >> >> > > > >>> e-mail and > >> >> >> > > > >>> > > > > > its attachments contain confidential > information > >> >> from > >> >> >> > > HUAWEI, > >> >> >> > > > >>> which > >> >> >> > > > >>> > > is > >> >> >> > > > >>> > > > > > intended only for the person or entity whose > >> >> address > >> >> >> is > >> >> >> > > > listed > >> >> >> > > > >>> > above. > >> >> >> > > > >>> > > > > > Any use of the information contained herein in > >> any > >> >> way > >> >> >> > > > >>> (including, > >> >> >> > > > >>> > > but > >> >> >> > > > >>> > > > > > not limited to, total or partial disclosure, > >> >> >> > reproduction, > >> >> >> > > or > >> >> >> > > > >>> > > > > > dissemination) by persons other than the > >> intended > >> >> >> > > > recipient(s) > >> >> >> > > > >>> is > >> >> >> > > > >>> > > > > > prohibited. If you receive this e-mail in > error, > >> >> >> please > >> >> >> > > > notify > >> >> >> > > > >>> the > >> >> >> > > > >>> > > > > > sender by phone or email immediately and > delete > >> it! > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > -----Original Message----- > >> >> >> > > > >>> > > > > > From: Vishnu Viswanath [mailto: > >> >> >> > > vishnu.viswanat...@gmail.com] > >> >> >> > > > >>> > > > > > Sent: Thursday, July 07, 2016 1:28 AM > >> >> >> > > > >>> > > > > > To: Dev > >> >> >> > > > >>> > > > > > Subject: Re: [DISCUSS] Enhance Window Evictor > in > >> >> Flink > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > Thank you Maxim and Aljoscha. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > Yes the beforeEvict and afterEvict should able > >> >> address > >> >> >> > > point > >> >> >> > > > 3. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > I have one more use case in my mind (which I > >> might > >> >> >> have > >> >> >> > to > >> >> >> > > do > >> >> >> > > > >>> in > >> >> >> > > > >>> > the > >> >> >> > > > >>> > > > > > later stages of POC). > >> >> >> > > > >>> > > > > > What if the `evictAfter` should behave > >> differently > >> >> >> based > >> >> >> > on > >> >> >> > > > the > >> >> >> > > > >>> > > window > >> >> >> > > > >>> > > > > > function. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > For example. > >> >> >> > > > >>> > > > > > I have a window that got triggered and my > evict > >> >> >> function > >> >> >> > is > >> >> >> > > > >>> being > >> >> >> > > > >>> > > > > > called after the apply function. In such > cases I > >> >> >> should > >> >> >> > be > >> >> >> > > > >>> able to > >> >> >> > > > >>> > > > > > decide on what I should evict based on the > >> window > >> >> >> > function. > >> >> >> > > > >>> > > > > > e.g., > >> >> >> > > > >>> > > > > > let the window have elements of type `case > class > >> >> >> Item(id: > >> >> >> > > > >>> String, > >> >> >> > > > >>> > > type: > >> >> >> > > > >>> > > > > > String)` and let the types be `type1` and > >> `type2`. > >> >> >> > > > >>> > > > > > If window function is able to find a sequence > : > >> >> `type1 > >> >> >> > > type2 > >> >> >> > > > >>> > type1`, > >> >> >> > > > >>> > > > > > then evict all elements of the type type2. > >> >> >> > > > >>> > > > > > or if the window function is able to find a > >> >> sequence > >> >> >> > `type2 > >> >> >> > > > >>> type2 > >> >> >> > > > >>> > > > > > type1`, then evict all elements of type type1 > >> else > >> >> >> don't > >> >> >> > > > evict > >> >> >> > > > >>> any > >> >> >> > > > >>> > > > > elements. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > Is this possible? or at least let the window > >> >> function > >> >> >> > > choose > >> >> >> > > > >>> > between > >> >> >> > > > >>> > > > > > two Evictor functions -(one for success case > and > >> >> one > >> >> >> > > failure > >> >> >> > > > >>> case) > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > @Maxim: > >> >> >> > > > >>> > > > > > regarding the sorted window, actually I wanted > >> my > >> >> >> > elements > >> >> >> > > to > >> >> >> > > > >>> be > >> >> >> > > > >>> > > > > > sorted but not for the eviction but while > >> applying > >> >> the > >> >> >> > > window > >> >> >> > > > >>> > > function > >> >> >> > > > >>> > > > > > (so thought this could be done easily). But it > >> >> would > >> >> >> be > >> >> >> > > good > >> >> >> > > > to > >> >> >> > > > >>> > have > >> >> >> > > > >>> > > > > > the window sorted based on EventTime. > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > Thanks and Regards, > >> >> >> > > > >>> > > > > > Vishnu Viswanath, > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > On Wed, Jul 6, 2016 at 3:55 PM, Maxim < > >> >> >> mfat...@gmail.com > >> >> >> > > > >> >> >> > > > >>> wrote: > >> >> >> > > > >>> > > > > > > >> >> >> > > > >>> > > > > > > Actually for such evictor to be useful the > >> window > >> >> >> > should > >> >> >> > > be > >> >> >> > > > >>> > sorted > >> >> >> > > > >>> > > > > > > by some field, usually event time. What do > you > >> >> think > >> >> >> > > about > >> >> >> > > > >>> adding > >> >> >> > > > >>> > > > > > > sorted window abstraction? > >> >> >> > > > >>> > > > > > > > >> >> >> > > > >>> > > > > > > On Wed, Jul 6, 2016 at 11:36 AM, Aljoscha > >> Krettek > >> >> >> > > > >>> > > > > > > <aljos...@apache.org> > >> >> >> > > > >>> > > > > > > wrote: > >> >> >> > > > >>> > > > > > > > >> >> >> > > > >>> > > > > > > > @Maxim: That's perfect I didn't think > about > >> >> using > >> >> >> > > > >>> > > > > > > > Iterator.remove() for that. I'll update > the > >> >> doc. > >> >> >> What > >> >> >> > > do > >> >> >> > > > >>> you > >> >> >> > > > >>> > > think > >> >> >> > > > >>> > > > > > > > Vishnu? This should also > >> >> >> > > > >>> > > > > > > cover > >> >> >> > > > >>> > > > > > > > your before/after case nicely. > >> >> >> > > > >>> > > > > > > > > >> >> >> > > > >>> > > > > > > > @Vishnu: The steps would be these: > >> >> >> > > > >>> > > > > > > > - Converge on a design in this discussion > >> >> >> > > > >>> > > > > > > > - Add a Jira issue here: > >> >> >> > > > >>> > > > > > > > > https://issues.apache.org/jira/browse/FLINK > >> >> >> > > > >>> > > > > > > > - Work on the code an create a pull > >> request on > >> >> >> > github > >> >> >> > > > >>> > > > > > > > > >> >> >> > > > >>> > > > > > > > The steps are also outlined here > >> >> >> > > > >>> > > > > > > > > >> http://flink.apache.org/how-to-contribute.html > >> >> >> and > >> >> >> > > here > >> >> >> > > > >>> > > > > > > > > >> http://flink.apache.org/contribute-code.html. > >> >> >> > > > >>> > > > > > > > > >> >> >> > > > >>> > > > > > > > - > >> >> >> > > > >>> > > > > > > > Aljoscha > >> >> >> > > > >>> > > > > > > > > >> >> >> > > > >>> > > > > > > > On Wed, 6 Jul 2016 at 19:45 Maxim < > >> >> >> mfat...@gmail.com > >> >> >> > > > >> >> >> > > > >>> wrote: > >> >> >> > > > >>> > > > > > > > > >> >> >> > > > >>> > > > > > > > > The new API forces iteration through > every > >> >> >> element > >> >> >> > of > >> >> >> > > > the > >> >> >> > > > >>> > > buffer > >> >> >> > > > >>> > > > > > > > > even > >> >> >> > > > >>> > > > > > > if > >> >> >> > > > >>> > > > > > > > a > >> >> >> > > > >>> > > > > > > > > single value to be evicted. What about > >> >> >> implementing > >> >> >> > > > >>> > > > > > > > > Iterator.remove() method for elements? > The > >> >> API > >> >> >> > would > >> >> >> > > > look > >> >> >> > > > >>> > like: > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > public interface Evictor<T, W extends > >> Window> > >> >> >> > extends > >> >> >> > > > >>> > > > > > > > > Serializable { > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > /** > >> >> >> > > > >>> > > > > > > > > * Optionally evicts elements. > Called > >> >> before > >> >> >> > > > >>> windowing > >> >> >> > > > >>> > > > > function. > >> >> >> > > > >>> > > > > > > > > * > >> >> >> > > > >>> > > > > > > > > * @param elements The elements > >> currently > >> >> in > >> >> >> the > >> >> >> > > > >>> pane. Use > >> >> >> > > > >>> > > > > > > > > Iterator.remove to evict. > >> >> >> > > > >>> > > > > > > > > * @param size The current number of > >> >> >> elements in > >> >> >> > > the > >> >> >> > > > >>> pane. > >> >> >> > > > >>> > > > > > > > > * @param window The {@link Window} > >> >> >> > > > >>> > > > > > > > > */ > >> >> >> > > > >>> > > > > > > > > void evictBefore(Iterable<T> > elements, > >> int > >> >> >> size, > >> >> >> > > > >>> > > > > > > > > EvictorContext > >> >> >> > > > >>> > > > > > > ctx); > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > /** > >> >> >> > > > >>> > > > > > > > > * Optionally evicts elements. > Called > >> >> after > >> >> >> > > > windowing > >> >> >> > > > >>> > > > function. > >> >> >> > > > >>> > > > > > > > > * > >> >> >> > > > >>> > > > > > > > > * @param elements The elements > >> currently > >> >> in > >> >> >> the > >> >> >> > > > >>> pane. Use > >> >> >> > > > >>> > > > > > > > > Iterator.remove to evict. > >> >> >> > > > >>> > > > > > > > > * @param size The current number of > >> >> >> elements in > >> >> >> > > the > >> >> >> > > > >>> pane. > >> >> >> > > > >>> > > > > > > > > * @param window The {@link Window} > >> >> >> > > > >>> > > > > > > > > */ > >> >> >> > > > >>> > > > > > > > > void evictAfter(Iterable<T> elements, > >> int > >> >> >> size, > >> >> >> > > > >>> > > > > > > > > EvictorContext ctx); } > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > Such API allows to abort iteration at > any > >> >> point > >> >> >> and > >> >> >> > > > evict > >> >> >> > > > >>> > > > > > > > > elements in > >> >> >> > > > >>> > > > > > > any > >> >> >> > > > >>> > > > > > > > > order. > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > Thanks, > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > Maxim. > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > On Wed, Jul 6, 2016 at 9:04 AM, Vishnu > >> >> >> Viswanath < > >> >> >> > > > >>> > > > > > > > > vishnu.viswanat...@gmail.com> wrote: > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > Hi Aljoscha, > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > Thanks. Yes the new interface seems to > >> >> address > >> >> >> > > points > >> >> >> > > > >>> 1 and > >> >> >> > > > >>> > > 2. > >> >> >> > > > >>> > > > > > > > > > of > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > *1) I am having a use case where I > have > >> to > >> >> >> > create a > >> >> >> > > > >>> custom > >> >> >> > > > >>> > > > > > > > > > Evictor > >> >> >> > > > >>> > > > > > > that > >> >> >> > > > >>> > > > > > > > > > will evict elements from the window > >> based > >> >> on > >> >> >> the > >> >> >> > > > value > >> >> >> > > > >>> > (e.g., > >> >> >> > > > >>> > > > > > > > > > if I > >> >> >> > > > >>> > > > > > > have > >> >> >> > > > >>> > > > > > > > > > elements are of case class Item(id: > Int, > >> >> >> > > type:String) > >> >> >> > > > >>> then > >> >> >> > > > >>> > > > > > > > > > evict > >> >> >> > > > >>> > > > > > > > elements > >> >> >> > > > >>> > > > > > > > > > that has type="a"). I believe this is > >> not > >> >> >> > currently > >> >> >> > > > >>> > > possible.* > >> >> >> > > > >>> > > > > > > > > > *2) this is somewhat related to 1) > where > >> >> there > >> >> >> > > should > >> >> >> > > > >>> be an > >> >> >> > > > >>> > > > > > > > > > option to > >> >> >> > > > >>> > > > > > > > > evict > >> >> >> > > > >>> > > > > > > > > > elements from anywhere in the window. > >> not > >> >> only > >> >> >> > from > >> >> >> > > > the > >> >> >> > > > >>> > > > > > > > > > beginning of > >> >> >> > > > >>> > > > > > > > the > >> >> >> > > > >>> > > > > > > > > > window. (e.g., apply the delta > function > >> to > >> >> all > >> >> >> > > > >>> elements and > >> >> >> > > > >>> > > > > > > > > > remove > >> >> >> > > > >>> > > > > > > all > >> >> >> > > > >>> > > > > > > > > > those don't pass. I checked the code > and > >> >> evict > >> >> >> > > method > >> >> >> > > > >>> just > >> >> >> > > > >>> > > > > > > > > > returns > >> >> >> > > > >>> > > > > > > the > >> >> >> > > > >>> > > > > > > > > > number of elements to be removed and > >> >> >> > > > >>> processTriggerResult > >> >> >> > > > >>> > > just > >> >> >> > > > >>> > > > > > > > > > skips > >> >> >> > > > >>> > > > > > > > > those > >> >> >> > > > >>> > > > > > > > > > many elements from the beginning. * > >> >> >> > > > >>> > > > > > > > > > *3) Add an option to enables the user > to > >> >> >> decide > >> >> >> > if > >> >> >> > > > the > >> >> >> > > > >>> > > > > > > > > > eviction > >> >> >> > > > >>> > > > > > > should > >> >> >> > > > >>> > > > > > > > > > happen before the apply function or > >> after > >> >> the > >> >> >> > apply > >> >> >> > > > >>> > function. > >> >> >> > > > >>> > > > > > > Currently > >> >> >> > > > >>> > > > > > > > > it > >> >> >> > > > >>> > > > > > > > > > is before the apply function, but I > >> have a > >> >> use > >> >> >> > case > >> >> >> > > > >>> where I > >> >> >> > > > >>> > > > > > > > > > need to > >> >> >> > > > >>> > > > > > > > first > >> >> >> > > > >>> > > > > > > > > > apply the function and evict > afterward.* > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > I would be interested in contributing > to > >> >> the > >> >> >> code > >> >> >> > > > base. > >> >> >> > > > >>> > > Please > >> >> >> > > > >>> > > > > > > > > > let me > >> >> >> > > > >>> > > > > > > > > know > >> >> >> > > > >>> > > > > > > > > > the steps. > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > Thanks and Regards, > >> >> >> > > > >>> > > > > > > > > > Vishnu Viswanath > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > On Wed, Jul 6, 2016 at 11:49 AM, > >> Aljoscha > >> >> >> > Krettek < > >> >> >> > > > >>> > > > > > > aljos...@apache.org > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > wrote: > >> >> >> > > > >>> > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > > Hi, > >> >> >> > > > >>> > > > > > > > > > > as mentioned in the thread on > >> improving > >> >> the > >> >> >> > > > Windowing > >> >> >> > > > >>> > API I > >> >> >> > > > >>> > > > > > > > > > > also > >> >> >> > > > >>> > > > > > > > have a > >> >> >> > > > >>> > > > > > > > > > > design doc just for improving > >> >> >> WindowEvictors. I > >> >> >> > > had > >> >> >> > > > >>> this > >> >> >> > > > >>> > in > >> >> >> > > > >>> > > > > > > > > > > my head > >> >> >> > > > >>> > > > > > > > for > >> >> >> > > > >>> > > > > > > > > a > >> >> >> > > > >>> > > > > > > > > > > while but was hesitant to publish > but > >> >> since > >> >> >> > > people > >> >> >> > > > >>> are > >> >> >> > > > >>> > > > > > > > > > > asking about > >> >> >> > > > >>> > > > > > > > > this > >> >> >> > > > >>> > > > > > > > > > > now might be a good time to post it. > >> >> Here's > >> >> >> the > >> >> >> > > > doc: > >> >> >> > > > >>> > > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > > > > >> >> >> > > > >>> > > > > > > > >> >> >> > > > >>> > > > >> >> >> > > > > >> >> >> > >> https://docs.google.com/document/d/1Rr7xzlItYqvFXLyyy-Yv0vvw8f29QYAj > >> >> >> > > > >>> > > > > > > m5 > >> >> >> > > > >>> > > > > > > i9E4A_JlU/edit?usp=sharing > >> >> >> > > > >>> > > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > > Feedback/Suggestions are very > welcome! > >> >> >> Please > >> >> >> > let > >> >> >> > > > me > >> >> >> > > > >>> know > >> >> >> > > > >>> > > > > > > > > > > what you > >> >> >> > > > >>> > > > > > > > > think. > >> >> >> > > > >>> > > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > > @Vishnu: Are you interested in > >> >> contributing > >> >> >> a > >> >> >> > > > >>> solution > >> >> >> > > > >>> > for > >> >> >> > > > >>> > > > > > > > > > > this to > >> >> >> > > > >>> > > > > > > > the > >> >> >> > > > >>> > > > > > > > > > > Flink code base? I'd be very happy > to > >> >> work > >> >> >> with > >> >> >> > > you > >> >> >> > > > >>> on > >> >> >> > > > >>> > > this. > >> >> >> > > > >>> > > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > > Cheers, > >> >> >> > > > >>> > > > > > > > > > > Aljoscha > >> >> >> > > > >>> > > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > > P.S. I think it would be best to > keep > >> >> >> > discussions > >> >> >> > > > to > >> >> >> > > > >>> the > >> >> >> > > > >>> > ML > >> >> >> > > > >>> > > > > > > > > > > because comments on the doc will not > >> be > >> >> >> visible > >> >> >> > > > here > >> >> >> > > > >>> for > >> >> >> > > > >>> > > > > > everyone. > >> >> >> > > > >>> > > > > > > > > > > > >> >> >> > > > >>> > > > > > > > > > >> >> >> > > > >>> > > > > > >