Ádám Szita created HIVE-25948: --------------------------------- Summary: Optimize Iceberg writes by directing records either Clustered- or Fanoutwriter Key: HIVE-25948 URL: https://issues.apache.org/jira/browse/HIVE-25948 Project: Hive Issue Type: Improvement Reporter: Ádám Szita Assignee: Ádám Szita
Currently Hive writes Iceberg tables with ClusteredWriter. This has less memory footprint as it only keeps one writer open at a time, but requires the records to be sorted. However if data cardinality is low Fanoutwriter is a better choice for performance. We should add support so that either can be used, and the decision could be based similarly how currently SortedDynPartitonOptimizer has it. -- This message was sent by Atlassian Jira (v8.20.1#820001)