i like the Aggregator a lot (org.apache.spark.sql.expressions.Aggregator),
but i find the way to use it somewhat confusing. I am supposed to simply
call aggregator.toColumn, but that doesn't allow me to specify which fields
it operates on in a DataFrame.

i would basically like to do something like
dataFrame
  .groupBy("k")
  .agg(
    myAggregator.on("v1", "v2").toColumn,
    myOtherAggregator.on("v3", "v4").toColumn
  )

Reply via email to