Yi Zhang created HIVE-26447:
-------------------------------

             Summary: Vectorization: wrong results when filter on repeating map 
key
                 Key: HIVE-26447
                 URL: https://issues.apache.org/jira/browse/HIVE-26447
             Project: Hive
          Issue Type: Bug
          Components: Hive
    Affects Versions: 3.1.3, 4.0.0
            Reporter: Yi Zhang
            Assignee: Yi Zhang


Example reproducible case:

 

set hive.vectorized.execution.enabled=true;

set hive.fetch.task.conversion=none;

create temporary table foo (id int, x map<string,int>) stored as orc;
insert into foo values(1, map('ABC', 9)), (2, map('ABC', 7)), (3, map('ABC', 
8)), (4, map('ABC', 9));

select id from foo where x['ABC']=9;

this only gives 1, when correct result should be 1,4

For every VectorizedRowBatch, only the first row is checked.  

This seems to be a corner case of ORC table have repeating string type key for 
map field in the MapColumnVector.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to