I agreed that to make sure this work, you might need to know the Spark internal implementation for APIs such as `groupBy`.
But without any more changes to current Spark implementation, I think this is the one possible way to achieve the required function to aggregate on sorted data per key. ----- Liang-Chi Hsieh | @viirya Spark Technology Center http://www.spark.tc/ -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/Aggregating-over-sorted-data-tp19999p20331.html Sent from the Apache Spark Developers List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org