Re: repartition in df vs partitionBy in df

rajat kumar Wed, 24 Apr 2019 22:11:15 -0700

hello,
thanks for quick reply.
got it . partitionBy is to create something like hive partitions.
but when do we use repartition actually?
how to decide whether to do repartition or not?
because in development we are getting sample data.
also what number should I give while repartition.


thanks

On Thu, 25 Apr 2019, 10:31 moqi <moqimoqi...@gmail.com wrote:

> Hello, I think you can refer to this link and hope to help you.
>
>
> https://stackoverflow.com/questions/40416357/spark-sql-difference-between-df-repartition-and-dataframewriter-partitionby/40417992
>
>
>
>
>
> --
> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscr...@spark.apache.org
>
>

Re: repartition in df vs partitionBy in df

Reply via email to