Re: User-defined aggregation function and parallelism

2018-04-16 Thread Fabian Hueske
Hi Bill, Flink's built-in aggregation functions are implemented against the same interface as UDAGGs and are applied in parallel. The performance depends of course on the implementation of the UDAGG. For example, you should try to keep the size of the accumulator as small as possible because it wi

User-defined aggregation function and parallelism

2018-04-15 Thread 杨力
I am running flink SQL in streaming mode and implemented a UDAGG, which is used in keyed HOP windows. But I found that the throughput decreases dramatically when the function is used. Does UDAGG run in parallell? Or does it run only in one thread? Regards, Bill