Todd Lisonbee created FLINK-3664: ------------------------------------ Summary: Create a method to easily Summarize a DataSet Key: FLINK-3664 URL: https://issues.apache.org/jira/browse/FLINK-3664 Project: Flink Issue Type: Improvement Reporter: Todd Lisonbee
Here is an example: /** * Summarize a DataSet of Tuples by collecting single pass statistics for all columns */ public Tuple summarize() Dataset<Tuple3<Double, String, Boolean>> input = // [...] Tuple3<DoubleColumnSummary,StringColumnSummary,BooleanColumnSummary> summary = input.summarize() summary.getField(0).stddev() summary.getField(1).maxStringLength() -- This message was sent by Atlassian JIRA (v6.3.4#6332)