Re: Customized Aggregation Query on Spark SQL

ayan guha Fri, 24 Apr 2015 21:43:21 -0700

can you give an example set of data and desired output>

On Sat, Apr 25, 2015 at 2:32 PM, Wenlei Xie <wenlei....@gmail.com> wrote:


> Hi,
>
> I would like to answer the following customized aggregation query on Spark
> SQL
> 1. Group the table by the value of Name
> 2. For each group, choose the tuple with the max value of Age (the ages
> are distinct for every name)
>
> I am wondering what's the best way to do it on Spark SQL? Should I use
> UDAF? Previously I am doing something like the following on Spark:
>
> personRDD.map(t => (t.name, t))
>     .reduceByKey((a, b) => if (a.age > b.age) a else b)
>
> Thank you!
>
> Best,
> Wenlei
>



-- 
Best Regards,
Ayan Guha

Re: Customized Aggregation Query on Spark SQL

Reply via email to