Re: DataFrame groupBy MapType

Michael Armbrust Mon, 06 Apr 2015 13:54:09 -0700

I'll add that I don't think there is a convenient way to do this in the
Column API ATM, but would welcome a JIRA for adding it :)


On Mon, Apr 6, 2015 at 1:45 PM, Michael Armbrust <mich...@databricks.com>
wrote:

> In HiveQL, you should be able to express this as:
>
> SELECT ... FROM table GROUP BY m['SomeKey']
>
> On Sat, Apr 4, 2015 at 5:25 PM, Justin Yip <yipjus...@prediction.io>
> wrote:
>
>> Hello,
>>
>> I have a case class like this:
>>
>> case class A(
>>   m: Map[Long, Long],
>>   ...
>> )
>>
>> and constructed a DataFrame from Seq[A].
>>
>> I would like to perform a groupBy on A.m("SomeKey"). I can implement a
>> UDF, create a new Column then invoke a groupBy on the new Column. But is it
>> the idiomatic way of doing such operation?
>>
>> Can't find much info about operating MapType on Column in the doc.
>>
>> Thanks ahead!
>>
>> Justin
>>
>
>

Re: DataFrame groupBy MapType

Reply via email to