[ 
https://issues.apache.org/jira/browse/HIVE-17209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16108161#comment-16108161
 ] 

Rajesh Balamohan edited comment on HIVE-17209 at 7/31/17 11:35 PM:
-------------------------------------------------------------------

Thanks [~sershe]. This would also need a fix in ORC 
(https://github.com/apache/orc/blob/master/java/core/src/java/org/apache/orc/impl/RecordReaderImpl.java#L554).
 Since {{getLiteralList()}} would be empty, it needs to check for empty 
structure to avoid IndexOutOfBoundsException. I will create separate ticket for 
that in ORC.


was (Author: rajesh.balamohan):
Thanks [~sershe]. This would also need a fix in ORC 
(https://github.com/apache/orc/blob/master/java/core/src/java/org/apache/orc/impl/RecordReaderImpl.java#L554).
 Since {{getLiteralList()}} would be empty, it needs to check for empty 
structure to avoid IndexOutOfBoundsException. 

> ObjectCacheFactory should return null when tez shared object registry is not 
> setup
> ----------------------------------------------------------------------------------
>
>                 Key: HIVE-17209
>                 URL: https://issues.apache.org/jira/browse/HIVE-17209
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>            Priority: Minor
>         Attachments: HIVE-17209.1.patch
>
>
> HIVE-15269 introduced dynamic min/max bloom filter 
> ("hive.tez.dynamic.semijoin.reduction=true"). This needs to access 
> ObjectCache and in tez, ObjectCache can only be created by {{TezProcessor}}.
> In the following case {{AM --> splits --> 
> OrcInputFormat.pickStripes::evaluatePredicateMinMax --> 
> DynamicValue.getLiteral --> objectCache access}}, AM ends up throwing lots of 
> NPE since AM has not created ObjectCache.  
> Orc reader catches these exceptions, skips PPD and proceeds further. For e.g, 
> in Q95 it ends up throwing ~30,000 NPE before completing split information.
> ObjectCacheFactory should return null when tez shared object registry is not 
> setup. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to