Ádám Szita created HIVE-25948:
---------------------------------

             Summary: Optimize Iceberg writes by directing records either 
Clustered- or Fanoutwriter
                 Key: HIVE-25948
                 URL: https://issues.apache.org/jira/browse/HIVE-25948
             Project: Hive
          Issue Type: Improvement
            Reporter: Ádám Szita
            Assignee: Ádám Szita


Currently Hive writes Iceberg tables with ClusteredWriter. This has less memory 
footprint as it only keeps one writer open at a time, but requires the records 
to be sorted.

However if data cardinality is low Fanoutwriter is a better choice for 
performance.

We should add support so that either can be used, and the decision could be 
based similarly how currently SortedDynPartitonOptimizer has it.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to