Filter on Grouped Data

Megha Sridhar- Cynepia Fri, 03 Jul 2015 01:00:08 -0700

Hi,


I have a Spark DataFrame object, which when trimmed, looks like,

From            To                  Subject        Message-ID

[email protected] ['[email protected]', SEC Inquiry<19952575.1075858>

             '[email protected]',
             '[email protected]',
             '[email protected]']

[email protected] ['[email protected]', Reviseddocuments <33499184.1075858>

             '[email protected]',
             '[email protected]']
.
.
.

I have run a groupBy("From") on the above dataFrame and obtained aGroupedData object as a result. I need to apply a filter on the groupeddata (for instance, getting the sender who sent maximum number of themails that were addressed to a particular receiver in the "To" list).

Is there a way to accomplish this by applying filter on grouped data?


Thanks,
Megha


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Filter on Grouped Data

Reply via email to