Yi Zhang created HIVE-26447: ------------------------------- Summary: Vectorization: wrong results when filter on repeating map key Key: HIVE-26447 URL: https://issues.apache.org/jira/browse/HIVE-26447 Project: Hive Issue Type: Bug Components: Hive Affects Versions: 3.1.3, 4.0.0 Reporter: Yi Zhang Assignee: Yi Zhang
Example reproducible case: set hive.vectorized.execution.enabled=true; set hive.fetch.task.conversion=none; create temporary table foo (id int, x map<string,int>) stored as orc; insert into foo values(1, map('ABC', 9)), (2, map('ABC', 7)), (3, map('ABC', 8)), (4, map('ABC', 9)); select id from foo where x['ABC']=9; this only gives 1, when correct result should be 1,4 For every VectorizedRowBatch, only the first row is checked. This seems to be a corner case of ORC table have repeating string type key for map field in the MapColumnVector. -- This message was sent by Atlassian Jira (v8.20.10#820010)