[ https://issues.apache.org/jira/browse/HIVE-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13719083#comment-13719083 ]
Prasanth J commented on HIVE-4123: ---------------------------------- Updated the patch with bug fix in patched base encoding. Formatting changes fixed in this patch. Added more test cases for patched base encoding that covers more edge cases. Also changes to TestFileDump has been removed, since the memory memory chooses stripe size based on available jvm memory which I vary for different test cases. > The RLE encoding for ORC can be improved > ---------------------------------------- > > Key: HIVE-4123 > URL: https://issues.apache.org/jira/browse/HIVE-4123 > Project: Hive > Issue Type: New Feature > Components: File Formats > Reporter: Owen O'Malley > Assignee: Prasanth J > Attachments: HIVE-4123.1.git.patch.txt, HIVE-4123.2.git.patch.txt, > HIVE-4123.3.patch.txt, HIVE-4123.4.patch.txt, > ORC-Compression-Ratio-Comparison.xlsx > > > The run length encoding of integers can be improved: > * tighter bit packing > * allow delta encoding > * allow longer runs -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira