Marcell Szabo created HIVE-13474:
------------------------------------

             Summary: avro partitions with common prefix get mixed up
                 Key: HIVE-13474
                 URL: https://issues.apache.org/jira/browse/HIVE-13474
             Project: Hive
          Issue Type: Bug
          Components: Query Processor
    Affects Versions: 1.1.0
            Reporter: Marcell Szabo


If we have two Avro tables in the same query, and the path of one table 
startsWith the other table's path (e.g. tableA, tableAbc), then the table 
processing get mixed up (slices of tableAbc are read when processing tableA).

Symptom, e.g.
 INFO [main] org.apache.hadoop.hive.ql.io.avro.AvroGenericRecordReader: 
Matching partition hdfs://nameservice1/foo/BAR.tableA with input split 
hdfs://nameservice1/foo/BAR.tableAbc/000008_0:0+134217728 

The problem seems to be in
https://github.com/apache/hive/blob/26b5c7b56a4f28ce3eabc0207566cce46b29b558/ql/src/java/org/apache/hadoop/hive/ql/io/avro/AvroGenericRecordReader.java#L114

pathIsInPartition() does a false match because partitionPath does not  contain 
a slash at the end.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to