Hi Spark Users,

repartition and partitionBy seems to be very same in Df. 
In which scenario we use one?

As per my understanding repartition is very expensive operation as it needs
full shuffle then when do we use repartition ?

Thanks
Rajat



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to