[jira] [Updated] (HIVE-2417) Merging of compressed rcfiles fails to write the valuebuffer part correctly

2011-08-29 Thread Krishna Kumar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2417: Status: Patch Available (was: Open) > Merging of compressed rcfiles fails to write the valuebuf

[jira] [Updated] (HIVE-2413) BlockMergeTask ignores client-specified jars

2011-08-29 Thread Krishna Kumar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2413: Assignee: Krishna Kumar Status: Patch Available (was: Open) > BlockMergeTask ignores cli

[jira] [Updated] (HIVE-2097) Explore mechanisms for better compression with RC Files

2012-05-14 Thread Krishna Kumar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2097: Attachment: datacomp.tar.gz unrefactored source for all the implemented compression codecs

[jira] [Commented] (HIVE-2623) Add Integer type compressors

2012-08-20 Thread Krishna Kumar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438434#comment-13438434 ] Krishna Kumar commented on HIVE-2623: - Hi Rao, I have not been able to do

[jira] [Created] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-20 Thread Krishna Kumar (Created) (JIRA)
, Serializers/Deserializers Reporter: Krishna Kumar Assignee: Krishna Kumar Priority: Minor Enable schema-aware compression codecs which can perform type-specific compression on a per-column basis. I see this as in three-parts 1. Add interfaces for the rcfile to

[jira] [Updated] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-20 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2600: Description: Enable schema-aware compression codecs which can perform type-specific compression on

[jira] [Updated] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-20 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2600: Attachment: HIVE-2600.v0.patch Added two interfaces and invocations at appropriate places

[jira] [Updated] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-20 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2600: Status: Patch Available (was: Open) > Enable/Add type-specific compression for rcf

[jira] [Commented] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-22 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13155228#comment-13155228 ] Krishna Kumar commented on HIVE-2600: - This should be usable for any rcfile data.

[jira] [Created] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-11-23 Thread Krishna Kumar (Created) (JIRA)
Project: Hive Issue Type: Sub-task Reporter: Krishna Kumar The strategies supported are 1. using a specified codec on the column 2. using a specific codec on the column which is serialized via a specific serde 3. using a specific "TypeSpecificCompressor" instanc

[jira] [Updated] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-11-23 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2604: Attachment: HIVE-2604.v0.patch initial version of the patch. > Add UberCompres

[jira] [Commented] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-23 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13156006#comment-13156006 ] Krishna Kumar commented on HIVE-2600: - Added the code for uber codec/serde to

[jira] [Assigned] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-11-23 Thread Krishna Kumar (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar reassigned HIVE-2604: --- Assignee: Krishna Kumar > Add UberCompressor Serde/Codec to contrib which allows

[jira] [Commented] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-11-24 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13156758#comment-13156758 ] Krishna Kumar commented on HIVE-2604: - Sure. I'll add some of the compressors

[jira] [Commented] (HIVE-2097) Explore mechanisms for better compression with RC Files

2011-11-28 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158484#comment-13158484 ] Krishna Kumar commented on HIVE-2097: - Thanks Alex for the suggestions. Just t

[jira] [Updated] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-28 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2600: Attachment: HIVE-2600.v1.patch Added logging to report every column's total le

[jira] [Commented] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-28 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159122#comment-13159122 ] Krishna Kumar commented on HIVE-2600: - I have chosen to record that information in

[jira] [Commented] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-11-29 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159734#comment-13159734 ] Krishna Kumar commented on HIVE-2600: - That is part of the compression codec

[jira] [Created] (HIVE-2623) Add Integer type compressors

2011-12-05 Thread Krishna Kumar (Created) (JIRA)
Add Integer type compressors Key: HIVE-2623 URL: https://issues.apache.org/jira/browse/HIVE-2623 Project: Hive Issue Type: Sub-task Reporter: Krishna Kumar Assignee: Krishna Kumar

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-05 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: HIVE-2623.v0.patch Elias gamma for a two-sided distribution. Draft only - tests and one

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-05 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: steppedpowerlaw.gz Synthetic data for elias gamma. Storage size when stored as rc

[jira] [Commented] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-12-05 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162705#comment-13162705 ] Krishna Kumar commented on HIVE-2600: - He Yongqiang, In UberCompressor, I have

[jira] [Commented] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-12-05 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162709#comment-13162709 ] Krishna Kumar commented on HIVE-2604: - The current implementation works as fol

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-06 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: (was: steppedpowerlaw.gz) > Add Integer type compress

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-06 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: steppedpowerlawTimestamp.gz steppedpowerlawIQ.gz

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-06 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: HIVE-2623.v1.patch Added 3 three more elias gamma compressors - so now supporting one

[jira] [Commented] (HIVE-2623) Add Integer type compressors

2011-12-06 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163437#comment-13163437 ] Krishna Kumar commented on HIVE-2623: - RCFile size stats: gzip/bzip2/

[jira] [Updated] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-12-08 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2604: Attachment: HIVE-2604.v1.patch Serde wraps lazysimpleserde to make it more similar to columnarserde

[jira] [Updated] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-12-08 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2604: Status: Patch Available (was: Open) > Add UberCompressor Serde/Codec to contrib which all

[jira] [Updated] (HIVE-2600) Enable/Add type-specific compression for rcfile

2011-12-08 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2600: Status: Patch Available (was: Open) > Enable/Add type-specific compression for rcf

[jira] [Commented] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-12-08 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165815#comment-13165815 ] Krishna Kumar commented on HIVE-2604: - :) I used the word Uber, not in the sens

[jira] [Updated] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-12-17 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2604: Attachment: HIVE-2604.v2.patch > Add UberCompressor Serde/Codec to contrib which allows

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Component/s: (was: Query Processor) (was: Serializers/Deserializers

[jira] [Updated] (HIVE-2604) Add UberCompressor Serde/Codec to contrib which allows per-column compression strategies

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2604: Component/s: (was: Query Processor) (was: Serializers/Deserializers

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: HIVE-2623.v2.patch Added another simple integer coder (unary) and the four variants of

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: (was: steppedpowerlaw1S.gz) > Add Integer type compress

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: (was: steppedpowerlaw2S.gz) > Add Integer type compress

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: (was: steppedpowerlawIQ.gz) > Add Integer type compress

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: data.tar.gz Artificial data sets for the 8 integer compressors. >

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Attachment: (was: steppedpowerlawTimestamp.gz) > Add Integer type compress

[jira] [Commented] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174824#comment-13174824 ] Krishna Kumar commented on HIVE-2623: - Compression efficiency stats (unary is opt

[jira] [Updated] (HIVE-2623) Add Integer type compressors

2011-12-22 Thread Krishna Kumar (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kumar updated HIVE-2623: Status: Patch Available (was: Open) > Add Integer type compress

[jira] [Created] (HIVE-2692) Add compressors for qualitative data types

2012-01-04 Thread Krishna Kumar (Created) (JIRA)
Reporter: Krishna Kumar Assignee: Krishna Kumar Priority: Minor Any data type (int, string et al) for which the frequency of occurrences can be counted (a.k.a. categorical or enumeration types), can be compressed via methods such as huffman, arithmetic and ppm compressors

[jira] [Commented] (HIVE-2623) Add Integer type compressors

2012-02-13 Thread Krishna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13206967#comment-13206967 ] Krishna Kumar commented on HIVE-2623: - I will attach cpu usage information

<    1   2