Spark iceberg query

Gaurav Agarwal Wed, 31 May 2023 23:19:36 -0700

Hi

We are loading the data in spark dataset
We don't do any repartitioning on the loaded dataset


If we want to load the in the iceberg partitioned table . If we do
dataset.sortwithinpartition() before doing append or merge or overwrite.

Will that be the best practice shall I go ahead?

Or if I go with dataset.repartition on the partition column and then do the
dataset.sortwithinpartitions() will that be the good practice.

Kindly let me know

Spark iceberg query

Reply via email to