[ https://issues.apache.org/jira/browse/HIVE-6143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13862826#comment-13862826 ]
Timothy Chen commented on HIVE-6143: ------------------------------------ I can refactor the hive code but would like to get some feedback if this can be an accepted approach, > Refactor Orc file format parsing logic to be shared > --------------------------------------------------- > > Key: HIVE-6143 > URL: https://issues.apache.org/jira/browse/HIVE-6143 > Project: Hive > Issue Type: Bug > Reporter: Timothy Chen > > Currently the Orc file format parsing logic is hidden in private methods in > reader and record reader classes, for example footer parsing, stream loading, > etc. > For the Orc file format to be a more reusable file format outside of Hive, I > suggest refactor these generic logic into a shared class. > The current interface of reading per serialized as objects is not suffice as > for columnar execution engines such as Drill/Impala, it's much more efficient > to load in columnar data into its own columnar in memory formats. -- This message was sent by Atlassian JIRA (v6.1.5#6160)