[ https://issues.apache.org/jira/browse/HIVE-26676?focusedWorklogId=823763&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-823763 ]
ASF GitHub Bot logged work on HIVE-26676: ----------------------------------------- Author: ASF GitHub Bot Created on: 06/Nov/22 19:16 Start Date: 06/Nov/22 19:16 Worklog Time Spent: 10m Work Description: amansinha100 commented on code in PR #3734: URL: https://github.com/apache/hive/pull/3734#discussion_r1014878611 ########## ql/src/test/queries/clientpositive/count_distinct_subquery.q: ########## @@ -0,0 +1,32 @@ +--! qt:dataset:src +SET hive.mapred.mode=nonstrict; +SET hive.vectorized.execution.enabled=false; +SET hive.optimize.topnkey=true; + +SET hive.optimize.ppd=true; +SET hive.ppd.remove.duplicatefilters=true; +SET hive.tez.dynamic.partition.pruning=true; +SET hive.optimize.metadataonly=false; +SET hive.optimize.index.filter=true; +SET hive.tez.min.bloom.filter.entries=1; + +SET hive.stats.fetch.column.stats=true; +SET hive.cbo.enable=true; + +SET hive.optimize.topnkey=true; Review Comment: Are all these settings needed ? Why not use the defaults ? Issue Time Tracking ------------------- Worklog Id: (was: 823763) Time Spent: 0.5h (was: 20m) > Count distinct in subquery returning wrong results > -------------------------------------------------- > > Key: HIVE-26676 > URL: https://issues.apache.org/jira/browse/HIVE-26676 > Project: Hive > Issue Type: Bug > Components: HiveServer2 > Reporter: Steve Carlin > Priority: Major > Labels: pull-request-available > Time Spent: 0.5h > Remaining Estimate: 0h > > The following select query should return 1 row > {code:java} > CREATE TABLE t_test( > a tinyint > ); > INSERT INTO t_test VALUES > (0), (1), (2), (3), (4), (5), (6), (7), (8), (9); > select 1 from (select count(distinct a) from t_test) x; > {code} > But it's returning 10 rows > -- This message was sent by Atlassian Jira (v8.20.10#820010)