Hi,
I have a Spark DataFrame object, which when trimmed, looks like,
From To Subject Message-ID
[email protected] ['[email protected]', SEC Inquiry
<19952575.1075858>
'[email protected]',
'[email protected]',
'[email protected]']
[email protected] ['[email protected]', Revised
documents <33499184.1075858>
'[email protected]',
'[email protected]']
.
.
.
I have run a groupBy("From") on the above dataFrame and obtained a
GroupedData object as a result. I need to apply a filter on the grouped
data (for instance, getting the sender who sent maximum number of the
mails that were addressed to a particular receiver in the "To" list).
Is there a way to accomplish this by applying filter on grouped data?
Thanks,
Megha
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]