[ 
https://issues.apache.org/jira/browse/HIVE-13828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15297816#comment-15297816
 ] 

Gopal V commented on HIVE-13828:
--------------------------------

-0 on the idea in general.

Mostly found that this caused OOMs during split generation, since the AMs which 
open > 1000+ files spend a significant amount of memory serializing these 
footer objects and that ends up in the recovery information for Tez.

> Enable hive.orc.splits.include.file.footer by default
> -----------------------------------------------------
>
>                 Key: HIVE-13828
>                 URL: https://issues.apache.org/jira/browse/HIVE-13828
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>            Reporter: Rajesh Balamohan
>            Priority: Minor
>
> As a part of setting up the OrcInputFormat.getRecordReader in the task side, 
> hive ends up opening the file path and reads the metadata information. If  
> hive.orc.splits.include.file.footer=true, this metadata info can be passed on 
> to task side which can help reduce the overhead.  It would be good to 
> consider enabling this parameter by default.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to