Hello dear Developer,
Currently aggregation functions are implemented based on sorting. We would like to add hash based aggregation to Flink. We would be thankful if you could tell as how to get started (shall we add it as an operator, runtime or ...). Our current thinking is to first hash partition the data based on grouping attribute and then apply aggregation function on the data that are in same bucket. Now we need to know the work flow of Aggregation in Flink.

With Respect,
Rafiullah Momand

Reply via email to