I think that roughly, an approach like the compacting hash table is the
right one.
Go ahead and take a stab at it, if you want, ping us if you run into
obstacles.
Here are a few thoughts on the hash-aggregator from discussions between
Fabian and me:
1) It may be worth to have a specialized implem
Hello,
I would really like to see FLINK-2237 solved.
I would implement this feature over the weekend, if the
CompactingHashTable can be used to solve it (see my comment there).
Could you please give me some advice on whether is this a viable
approach, or you perhaps see some difficulties that I'm
gt; 2015-06-17 22:41 GMT+02:00 :
>
> >
> > Hello dear Developer,
> > Currently aggregation functions are implemented based on sorting. We
> would
> > like to add hash based aggregation to Flink. We would be thankful if you
> > could tell as how to get started (shall we a
I added a comment with suggestions how to proceed in the JIRA issue.
2015-06-17 22:41 GMT+02:00 :
>
> Hello dear Developer,
> Currently aggregation functions are implemented based on sorting. We would
> like to add hash based aggregation to Flink. We would be thankful if you
> co
Hello dear Developer,
Currently aggregation functions are implemented based on sorting. We
would like to add hash based aggregation to Flink. We would be
thankful if you could tell as how to get started (shall we add it as
an operator, runtime or ...). Our current thinking is to first hash
Rafiullah Momand created FLINK-2237:
---
Summary: Add hash-based Aggregation
Key: FLINK-2237
URL: https://issues.apache.org/jira/browse/FLINK-2237
Project: Flink
Issue Type: New Feature