[
https://issues.apache.org/jira/browse/HIVE-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2417:
Status: Patch Available (was: Open)
> Merging of compressed rcfiles fails to write the valuebuf
[
https://issues.apache.org/jira/browse/HIVE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2413:
Assignee: Krishna Kumar
Status: Patch Available (was: Open)
> BlockMergeTask ignores cli
[
https://issues.apache.org/jira/browse/HIVE-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2097:
Attachment: datacomp.tar.gz
unrefactored source for all the implemented compression codecs
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438434#comment-13438434
]
Krishna Kumar commented on HIVE-2623:
-
Hi Rao,
I have not been able to do
, Serializers/Deserializers
Reporter: Krishna Kumar
Assignee: Krishna Kumar
Priority: Minor
Enable schema-aware compression codecs which can perform type-specific
compression on a per-column basis. I see this as in three-parts
1. Add interfaces for the rcfile to
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2600:
Description:
Enable schema-aware compression codecs which can perform type-specific
compression on
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2600:
Attachment: HIVE-2600.v0.patch
Added two interfaces and invocations at appropriate places
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2600:
Status: Patch Available (was: Open)
> Enable/Add type-specific compression for rcf
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13155228#comment-13155228
]
Krishna Kumar commented on HIVE-2600:
-
This should be usable for any rcfile data.
Project: Hive
Issue Type: Sub-task
Reporter: Krishna Kumar
The strategies supported are
1. using a specified codec on the column
2. using a specific codec on the column which is serialized via a specific serde
3. using a specific "TypeSpecificCompressor" instanc
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2604:
Attachment: HIVE-2604.v0.patch
initial version of the patch.
> Add UberCompres
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13156006#comment-13156006
]
Krishna Kumar commented on HIVE-2600:
-
Added the code for uber codec/serde to
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar reassigned HIVE-2604:
---
Assignee: Krishna Kumar
> Add UberCompressor Serde/Codec to contrib which allows
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13156758#comment-13156758
]
Krishna Kumar commented on HIVE-2604:
-
Sure. I'll add some of the compressors
[
https://issues.apache.org/jira/browse/HIVE-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158484#comment-13158484
]
Krishna Kumar commented on HIVE-2097:
-
Thanks Alex for the suggestions.
Just t
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2600:
Attachment: HIVE-2600.v1.patch
Added logging to report every column's total le
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159122#comment-13159122
]
Krishna Kumar commented on HIVE-2600:
-
I have chosen to record that information in
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159734#comment-13159734
]
Krishna Kumar commented on HIVE-2600:
-
That is part of the compression codec
Add Integer type compressors
Key: HIVE-2623
URL: https://issues.apache.org/jira/browse/HIVE-2623
Project: Hive
Issue Type: Sub-task
Reporter: Krishna Kumar
Assignee: Krishna Kumar
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: HIVE-2623.v0.patch
Elias gamma for a two-sided distribution. Draft only - tests and one
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: steppedpowerlaw.gz
Synthetic data for elias gamma.
Storage size when stored as rc
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162705#comment-13162705
]
Krishna Kumar commented on HIVE-2600:
-
He Yongqiang,
In UberCompressor, I have
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162709#comment-13162709
]
Krishna Kumar commented on HIVE-2604:
-
The current implementation works as fol
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: (was: steppedpowerlaw.gz)
> Add Integer type compress
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: steppedpowerlawTimestamp.gz
steppedpowerlawIQ.gz
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: HIVE-2623.v1.patch
Added 3 three more elias gamma compressors - so now supporting one
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163437#comment-13163437
]
Krishna Kumar commented on HIVE-2623:
-
RCFile size stats:
gzip/bzip2/
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2604:
Attachment: HIVE-2604.v1.patch
Serde wraps lazysimpleserde to make it more similar to columnarserde
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2604:
Status: Patch Available (was: Open)
> Add UberCompressor Serde/Codec to contrib which all
[
https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2600:
Status: Patch Available (was: Open)
> Enable/Add type-specific compression for rcf
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165815#comment-13165815
]
Krishna Kumar commented on HIVE-2604:
-
:) I used the word Uber, not in the sens
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2604:
Attachment: HIVE-2604.v2.patch
> Add UberCompressor Serde/Codec to contrib which allows
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Component/s: (was: Query Processor)
(was: Serializers/Deserializers
[
https://issues.apache.org/jira/browse/HIVE-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2604:
Component/s: (was: Query Processor)
(was: Serializers/Deserializers
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: HIVE-2623.v2.patch
Added another simple integer coder (unary) and the four variants of
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: (was: steppedpowerlaw1S.gz)
> Add Integer type compress
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: (was: steppedpowerlaw2S.gz)
> Add Integer type compress
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: (was: steppedpowerlawIQ.gz)
> Add Integer type compress
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: data.tar.gz
Artificial data sets for the 8 integer compressors.
>
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Attachment: (was: steppedpowerlawTimestamp.gz)
> Add Integer type compress
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174824#comment-13174824
]
Krishna Kumar commented on HIVE-2623:
-
Compression efficiency stats (unary is opt
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krishna Kumar updated HIVE-2623:
Status: Patch Available (was: Open)
> Add Integer type compress
Reporter: Krishna Kumar
Assignee: Krishna Kumar
Priority: Minor
Any data type (int, string et al) for which the frequency of occurrences can be
counted (a.k.a. categorical or enumeration types), can be compressed via
methods such as huffman, arithmetic and ppm compressors
[
https://issues.apache.org/jira/browse/HIVE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13206967#comment-13206967
]
Krishna Kumar commented on HIVE-2623:
-
I will attach cpu usage information
101 - 144 of 144 matches
Mail list logo