Hi Spark Users,

repartition and partitionBy seems to be very same in Df. 
In which scenario we use one?

As per my understanding repartition is very expensive operation as it needs
full shuffle then when do we use repartition ?

Thanks
Rajat



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]

Reply via email to