[ 
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-2600:
------------------------------

    Attachment: HIVE-2600.D1005.1.patch

krishnakumar requested code review of "HIVE-2600 [jira] Enable/Add 
type-specific compression for rcfile".
Reviewers: JIRA

  adding schema-aware compression codec interfaces

  Enable schema-aware compression codecs which can perform type-specific 
compression on a per-column basis. I see this as in three-parts

  1. Add interfaces for the rcfile to communicate column information to the 
codec

  2. Add an "uber compressor" which can perform column-specific compression on 
a per-block basis. Initially, this can be config driven, but we can go for a 
dynamic implementation later.

  3. A bunch of type-specific compressors

  This jira is for the first part of the effort.

TEST PLAN
  EMPTY

REVISION DETAIL
  https://reviews.facebook.net/D1005

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/io/RCFile.java
  
ql/src/java/org/apache/hadoop/hive/ql/io/SchemaAwareCompressionInputStream.java
  
ql/src/java/org/apache/hadoop/hive/ql/io/SchemaAwareCompressionOutputStream.java

MANAGE HERALD DIFFERENTIAL RULES
  https://reviews.facebook.net/herald/view/differential/

WHY DID I GET THIS EMAIL?
  https://reviews.facebook.net/herald/transcript/2103/

Tip: use the X-Herald-Rules header to filter Herald messages in your client.

                
> Enable/Add type-specific compression for rcfile
> -----------------------------------------------
>
>                 Key: HIVE-2600
>                 URL: https://issues.apache.org/jira/browse/HIVE-2600
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Query Processor, Serializers/Deserializers
>            Reporter: Krishna Kumar
>            Assignee: Krishna Kumar
>            Priority: Minor
>         Attachments: HIVE-2600.D1005.1.patch, HIVE-2600.v0.patch, 
> HIVE-2600.v1.patch
>
>
> Enable schema-aware compression codecs which can perform type-specific 
> compression on a per-column basis. I see this as in three-parts
> 1. Add interfaces for the rcfile to communicate column information to the 
> codec
> 2. Add an "uber compressor" which can perform column-specific compression on 
> a per-block basis. Initially, this can be config driven, but we can go for a 
> dynamic implementation later.
> 3. A bunch of type-specific compressors
> This jira is for the first part of the effort.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to