Hi all,
Can somebody put some light on this pls?
Thanks,
Aakash.
-- Forwarded message --
From: "Aakash Basu"
Date: 15-Jun-2017 2:57 PM
Subject: Repartition vs PartitionBy Help/Understanding needed
To: "user"
Cc:
Hi all,
>
> Everybody is giving a diffe
Hi all,
Everybody is giving a difference between coalesce and repartition, but
nowhere I found a difference between partitionBy and repartition. My
question is, is it better to write a data set in parquet partitioning by a
column and then reading the respective directories to work on that column
i