Elliot West created HIVE-19457:
----------------------------------

             Summary: Support Iceberg as a table format
                 Key: HIVE-19457
                 URL: https://issues.apache.org/jira/browse/HIVE-19457
             Project: Hive
          Issue Type: New Feature
          Components: Metastore, StorageHandler
            Reporter: Elliot West


>From [Netflix/iceberg|https://github.com/Netflix/iceberg]:
{quote}
Iceberg is a new table format for storing large, slow-moving tabular data. It 
is designed to improve on the de-facto standard table layout built into Hive, 
Presto, and Spark.
{quote}

Iceberg elegantly solves many data and metadata challenges when handling large 
datasets, especially when using eventually consistent file stores. By 
integrating iceberg with Hive we could hope to gain the following:

* Improved performance when reading large numbers of partitions (HMS is no 
longer a bottleneck).
* Users receiving a consistent view when reading changing datasets (snapshot 
isolation).
* Strong consistency on eventually consistent file stores without the need for 
additional infrastructure (data is immutable, eventually consistent ops are 
avoided; data is always {{PUT}} and {{LIST}} is avoided).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to