[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Phabricator updated HIVE-2600: ------------------------------ Attachment: HIVE-2600.D1005.1.patch krishnakumar requested code review of "HIVE-2600 [jira] Enable/Add type-specific compression for rcfile". Reviewers: JIRA adding schema-aware compression codec interfaces Enable schema-aware compression codecs which can perform type-specific compression on a per-column basis. I see this as in three-parts 1. Add interfaces for the rcfile to communicate column information to the codec 2. Add an "uber compressor" which can perform column-specific compression on a per-block basis. Initially, this can be config driven, but we can go for a dynamic implementation later. 3. A bunch of type-specific compressors This jira is for the first part of the effort. TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D1005 AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/io/RCFile.java ql/src/java/org/apache/hadoop/hive/ql/io/SchemaAwareCompressionInputStream.java ql/src/java/org/apache/hadoop/hive/ql/io/SchemaAwareCompressionOutputStream.java MANAGE HERALD DIFFERENTIAL RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/2103/ Tip: use the X-Herald-Rules header to filter Herald messages in your client. > Enable/Add type-specific compression for rcfile > ----------------------------------------------- > > Key: HIVE-2600 > URL: https://issues.apache.org/jira/browse/HIVE-2600 > Project: Hive > Issue Type: Sub-task > Components: Query Processor, Serializers/Deserializers > Reporter: Krishna Kumar > Assignee: Krishna Kumar > Priority: Minor > Attachments: HIVE-2600.D1005.1.patch, HIVE-2600.v0.patch, > HIVE-2600.v1.patch > > > Enable schema-aware compression codecs which can perform type-specific > compression on a per-column basis. I see this as in three-parts > 1. Add interfaces for the rcfile to communicate column information to the > codec > 2. Add an "uber compressor" which can perform column-specific compression on > a per-block basis. Initially, this can be config driven, but we can go for a > dynamic implementation later. > 3. A bunch of type-specific compressors > This jira is for the first part of the effort. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira