Steve Carlin created HIVE-26671: ----------------------------------- Summary: Incorrect results for group by/order by/limit query with 2 aggregates Key: HIVE-26671 URL: https://issues.apache.org/jira/browse/HIVE-26671 Project: Hive Issue Type: Bug Components: Operators Reporter: Steve Carlin
Grabbed this query from the Impala test suite. It is a query run off of tpcds tables, but it's not really super special. You will need a lot of data to reproduce this, though. select l_orderkey, min(l_shipdate) as flt, count(distinct l_partkey) as cnl from lineitem group by l_orderkey order by l_orderkey limit 2; The issue is with the Top N Key operator optimizer. The Top N Key operator is the first operator after the Table Scan. The sort key is on both the l_orderkey and l_partkey columns, but this means that the second sort key might not be forwarded. -- This message was sent by Atlassian Jira (v8.20.10#820010)