[ https://issues.apache.org/jira/browse/HIVE-16869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16045300#comment-16045300 ]
Hive QA commented on HIVE-16869: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12872361/HIVE-16869.2.patch {color:green}SUCCESS:{color} +1 due to 2 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 10832 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[orc_ppd_basic] (batchId=140) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_if_expr] (batchId=145) org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_2] (batchId=99) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query14] (batchId=232) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query23] (batchId=232) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query78] (batchId=232) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/5611/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/5611/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-5611/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 6 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12872361 - PreCommit-HIVE-Build > Hive returns wrong result when predicates on non-existing columns are pushed > down to Parquet reader > --------------------------------------------------------------------------------------------------- > > Key: HIVE-16869 > URL: https://issues.apache.org/jira/browse/HIVE-16869 > Project: Hive > Issue Type: Bug > Reporter: Yibing Shi > Assignee: Yibing Shi > Priority: Critical > Attachments: HIVE-16869.1.patch, HIVE-16869.2.patch > > > When {{hive.optimize.ppd}} and {{hive.optimize.index.filter}} are turned, and > a select query has a condition on a column that doesn't exist in Parquet file > (such as a partition column), Hive often returns wrong result. > Please see below example for details: > {noformat} > hive> create table test_parq (a int, b int) partitioned by (p int) stored as > parquet; > OK > Time taken: 0.292 seconds > hive> insert overwrite table test_parq partition (p=1) values (1, 2); > OK > Time taken: 5.08 seconds > hive> select * from test_parq where a=1 and p=1; > OK > 1 2 1 > Time taken: 0.441 seconds, Fetched: 1 row(s) > hive> select * from test_parq where (a=1 and p=1) or (a=999 and p=999); > OK > 1 2 1 > Time taken: 0.197 seconds, Fetched: 1 row(s) > hive> set hive.optimize.index.filter=true; > hive> select * from test_parq where (a=1 and p=1) or (a=999 and p=999); > OK > Time taken: 0.167 seconds > hive> select * from test_parq where (a=1 or a=999) and (a=999 or p=1); > OK > Time taken: 0.563 seconds > {noformat} -- This message was sent by Atlassian JIRA (v6.3.15#6346)