[jira] [Updated] (HIVE-28651) Make Hadoop Vectored IO work in Apache Hive

Butao Zhang (Jira) Thu, 28 Nov 2024 19:13:04 -0800


     [ 
https://issues.apache.org/jira/browse/HIVE-28651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Butao Zhang updated HIVE-28651:
-------------------------------
    Description: 
Hadoop3.3.5+ （HADOOP-18103）added the Hadoop Vectored IO feature which can be 
high-performance against cloud storage.

Apache ORC ORC-1251 and Apache Parquet PARQUET-2171 have integrated this great 
feature, so we can try make it work in Apache Hive.

We need to upgrade ORC&Parquet first, then try to test if the Vectored IO can 
work.
 # HIVE-28650 Upgrade Apache ORC version to 2.0.3
 # HIVE-28625 Upgrade Apache Parquet version to 1.14.4

  was:
Hadoop3.3.5+ （HADOOP-18103）added the Hadoop Vectored IO feature which can be 
high-performance against cloud storage.

Apache ORC ORC-1251 and Apache Parquet PARQUET-2171 have integrated this greate 
feature, so we can try make it work in Apache Hive.

We need to upgrade ORC&Parquet first, then try to test if the Vectored IO can 
work.
 # HIVE-28650 Upgrade Apache ORC version to 2.0.3
 # HIVE-28625 Upgrade Apache Parquet version to 1.14.4


> Make Hadoop Vectored IO work in Apache Hive
> -------------------------------------------
>
>                 Key: HIVE-28651
>                 URL: https://issues.apache.org/jira/browse/HIVE-28651
>             Project: Hive
>          Issue Type: Improvement
>      Security Level: Public(Viewable by anyone) 
>            Reporter: Butao Zhang
>            Priority: Major
>
> Hadoop3.3.5+ （HADOOP-18103）added the Hadoop Vectored IO feature which can be 
> high-performance against cloud storage.
> Apache ORC ORC-1251 and Apache Parquet PARQUET-2171 have integrated this 
> great feature, so we can try make it work in Apache Hive.
> We need to upgrade ORC&Parquet first, then try to test if the Vectored IO can 
> work.
>  # HIVE-28650 Upgrade Apache ORC version to 2.0.3
>  # HIVE-28625 Upgrade Apache Parquet version to 1.14.4



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (HIVE-28651) Make Hadoop Vectored IO work in Apache Hive

Reply via email to