[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
--------------------------------
Attachment: steppedpowerlaw.gz
Synthetic data for elias gamma.
Storage size when stored as rc files (single field record):
plain: 188320
gzip: 61280
bzip2: 44023
uber+eliasgamma: 35963
Actual gains in production will depend upon actual probability distributions +
size of other columns.
> Add Integer type compressors
> ----------------------------
>
> Key: HIVE-2623
> URL: https://issues.apache.org/jira/browse/HIVE-2623
> Project: Hive
> Issue Type: Sub-task
> Components: Query Processor, Serializers/Deserializers
> Reporter: Krishna Kumar
> Assignee: Krishna Kumar
> Priority: Minor
> Attachments: HIVE-2623.v0.patch, steppedpowerlaw.gz
>
>
> Type-specific compressors for integers.
> Starting with elias gamma which prefers small values as per a power-law like
> distribution.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira