[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13155228#comment-13155228
]
Krishna Kumar commented on HIVE-2600:
-------------------------------------
This should be usable for any rcfile data. I am looking at end user data
storage optimizations only. I will post the code for the uber codec/serde in a
day or two which will enable column-specific selection of compression
mechanisms.
> Enable/Add type-specific compression for rcfile
> -----------------------------------------------
>
> Key: HIVE-2600
> URL: https://issues.apache.org/jira/browse/HIVE-2600
> Project: Hive
> Issue Type: Sub-task
> Components: Query Processor, Serializers/Deserializers
> Reporter: Krishna Kumar
> Assignee: Krishna Kumar
> Priority: Minor
> Attachments: HIVE-2600.v0.patch
>
>
> Enable schema-aware compression codecs which can perform type-specific
> compression on a per-column basis. I see this as in three-parts
> 1. Add interfaces for the rcfile to communicate column information to the
> codec
> 2. Add an "uber compressor" which can perform column-specific compression on
> a per-block basis. Initially, this can be config driven, but we can go for a
> dynamic implementation later.
> 3. A bunch of type-specific compressors
> This jira is for the first part of the effort.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira