Hi Pol, I had considered repartitioning but the main issue for me there is that it will trigger a shuffle and could significantly slow down the query/application as a result. Thanks for contributing that as an alternative suggestion though :)
-- Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org