[jira] [Commented] (HIVE-5783) Native Parquet Support in Hive

Carl Steinbach (JIRA) Fri, 06 Dec 2013 15:12:25 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13841855#comment-13841855
 ]


Carl Steinbach commented on HIVE-5783:
--------------------------------------

[~jcoffey] Would you and your coworkers be willing to consider the option of 
committing the SerDe code directly to Hive instead of having Hive depend on a 
third-party JAR? I appreciate that this will make it a little less convenient 
for you to push in changes. However, I think there are two big drawbacks to the 
third-party JAR approach: 1) existing Hive contributors will be much less 
likely contribute improvements to this code since it lives in a different 
repository, and 2) Hive won't be able to benefit from parquet-serde 
improvements until they appear in a new parquet-serde release.

> Native Parquet Support in Hive
> ------------------------------
>
>                 Key: HIVE-5783
>                 URL: https://issues.apache.org/jira/browse/HIVE-5783
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Justin Coffey
>            Assignee: Justin Coffey
>            Priority: Minor
>             Fix For: 0.11.0
>
>         Attachments: HIVE-5783.patch, hive-0.11-parquet.patch
>
>
> Problem Statement:
> Hive would be easier to use if it had native Parquet support. Our 
> organization, Criteo, uses Hive extensively. Therefore we built the Parquet 
> Hive integration and would like to now contribute that integration to Hive.
> About Parquet:
> Parquet is a columnar storage format for Hadoop and integrates with many 
> Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, 
> Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native 
> Parquet integration.
> Changes Details:
> Parquet was built with dependency management in mind and therefore only a 
> single Parquet jar will be added as a dependency.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5783) Native Parquet Support in Hive

Reply via email to