Hi thnx for the feedback. So there is a limitation due to parallel windows implementation. No intentions to change that somehow to accommodate similar estimations?
WindowAll in practice is used as step in the pipeline? I mean since its inherently not parallel cannot scale correct? Although there is an exception: "Only for special cases, such as aligned time windows is it possible to perform this operation in parallel" Probably missing something... I could try do the example stuff (and open a new feature on jira for that). I will also vote for closing the old issue too since there is no other way at least for the time being... Thanx, Stavros On Fri, May 20, 2016 at 7:02 PM, Aljoscha Krettek <aljos...@apache.org> wrote: > Hi, > with how the window API currently works this can only be done for > non-parallel windows. For keyed windows everything that happens is scoped > to the key of the elements: window contents are kept in per-key state, > triggers fire on a per-key basis. Therefore a count-min sketch cannot be > used because it would require to keep state across keys. > > For non-parallel windows a user could do this: > > DataStream input = ... > input > .windowAll(<some window>) > .fold(new MySketch(), new MySketchFoldFunction()) > > with sketch data types and a fold function that is tailored to the user > types. Therefore, I would prefer to not add a special API for this and vote > to close https://issues.apache.org/jira/browse/FLINK-2147. I already > commented on https://issues.apache.org/jira/browse/FLINK-2144, saying a > similar thing. > > What I would welcome very much is to add some well documented examples to > Flink that showcase how some of these operations can be written. > > Cheers, > Aljoscha > > On Thu, 19 May 2016 at 16:38 Stavros Kontopoulos <st.kontopou...@gmail.com > > > wrote: > > > Hi guys, > > > > I would like to push forward the work here: > > https://issues.apache.org/jira/browse/FLINK-2147 > > > > Can anyone more familiar with streaming api verify if this could be a > > mature task. > > The intention is to summarize data over a window like in the case of > > StreamGroupedFold. > > Specifically implement count min in a window. > > > > Best, > > Stavros > > >