For a dataset as small as this one you could probably reduce the number of shuffle partitions. This will be possible once https://github.com/apache/spark/pull/956 is merged.
On Thu, Jun 5, 2014 at 11:31 AM, ssb61 <santoshbalma...@gmail.com> wrote: > Any inputs to reduce the time duration for mapPartitions at > Exchange.scala:44 > from 13 s? > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/SQLContext-and-HiveContext-Query-Performance-tp6948p7075.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. >