[ 
https://issues.apache.org/jira/browse/HIVE-6143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13862826#comment-13862826
 ] 

Timothy Chen commented on HIVE-6143:
------------------------------------

I can refactor the hive code but would like to get some feedback if this can be 
an accepted approach,

> Refactor Orc file format parsing logic to be shared
> ---------------------------------------------------
>
>                 Key: HIVE-6143
>                 URL: https://issues.apache.org/jira/browse/HIVE-6143
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Timothy Chen
>
> Currently the Orc file format parsing logic is hidden in private methods in 
> reader and record reader classes, for example footer parsing, stream loading, 
> etc.
> For the Orc file format to be a more reusable file format outside of Hive, I 
> suggest refactor these generic logic into a shared class.
> The current interface of reading per serialized as objects is not suffice as 
> for columnar execution engines such as Drill/Impala, it's much more efficient 
> to load in columnar data into its own columnar in memory formats.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to