Siddharth Seth created HIVE-12624:
-------------------------------------

             Summary: HYBRID ORC split strategy can generate inconsistent 
splits across two otherwise identical runs
                 Key: HIVE-12624
                 URL: https://issues.apache.org/jira/browse/HIVE-12624
             Project: Hive
          Issue Type: Bug
            Reporter: Siddharth Seth


When using the HYBRID strategy to execute the same query twice - the raw splits 
generated in the two runs can differ. Wondering if this has something to do 
with footer caching.

cc [~prasanth_j]

14: [ 
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs:///...date=2015-11-03/000000_0:3+367753]
 != [ 
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs://...date=2015-11-03/000000_0:0+367958]
65: [ 
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs://...date=2015-11-10/000002_0:0+387632]
 != [ 
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs://...date=2015-11-10/000002_0:3+387427]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to