Hi

We are loading the data in spark dataset
We don't do any repartitioning on the loaded dataset

If we want to load the in the iceberg partitioned table . If we do
dataset.sortwithinpartition() before doing append or merge or overwrite.

Will that be the best practice shall I go ahead?

Or if I go with dataset.repartition on the partition column and then do the
dataset.sortwithinpartitions() will that be the good practice.

Kindly let me know

Reply via email to