Re: Dataset statistics

2019-02-19 Thread Flavio Pompermaier
We've just published a first attempt (on Flink 1.6.2) that extract some descriptive statistics from a batch dataset[1]. Any feedback is welcome. Best, Flavio [1] https://github.com/okkam-it/flink-descriptive-stats On Thu, Feb 14, 2019 at 11:19 AM Flavio Pompermaier wrote: > No effort in this d

Re: Dataset statistics

2019-02-14 Thread Flavio Pompermaier
No effort in this direction, then? I had a try using SQL on Table API but I fear that the generated plan is not the optimal one..I'm looking for an efficient way to implement describe() method on a table or dataset/datasource On Fri, Feb 8, 2019 at 10:35 AM Flavio Pompermaier wrote: > Hi to all,

Dataset statistics

2019-02-08 Thread Flavio Pompermaier
Hi to all, is there any effort to standardize descriptive statistics in Apache Flink? Is there any suggested way to achieve this? Best, Flavio