Hi,
Including query plan :
DataFrame :
== Physical Plan ==
SortBasedAggregate(key=[agreement#23],
functions=[(MaxVectorAggFunction(values#3),mode=Final,isDistinct=false)],
output=[agreement#23,maxvalues#27])
+- ConvertToSafe
+- Sort [agreement#23 ASC], false, 0
+- TungstenExchange hashpa
The focus of this release was to get the API out there and there's a lot of
low hanging performance optimizations. That said, there is likely always
going to be some cost of materializing objects.
Another note, anytime your comparing performance its useful to include the
output of explain so we c