Re: Support for Percentile and Variance Aggregation functions in Spark with HiveContext

2014-07-25 Thread Michael Armbrust
Hmm, in general we try to support all the UDAFs, but this one must be using a different base class that we don't have a wrapper for. JIRA here: https://issues.apache.org/jira/browse/SPARK-2693 On Fri, Jul 25, 2014 at 8:06 AM, wrote: > > Hi all, > > I am using Spark 1.0.0 with CDH 5.1.0. > > I

Support for Percentile and Variance Aggregation functions in Spark with HiveContext

2014-07-25 Thread vinay . kashyap
Hi all, I am using Spark 1.0.0 with CDH 5.1.0. I want to aggregate the data in a raw table using a simple query like below SELECT MIN(field1), MAX(field2), AVG(field3), PERCENTILE(field4), year,month,day FROMĀ  raw_data_tableĀ  GROUP BY year, month, day MIN, MAX and AVG functions work fine for m