Sergey Shelukhin created HIVE-5657:
--------------------------------------

             Summary: TopN produces incorrect results with count(distinct)
                 Key: HIVE-5657
                 URL: https://issues.apache.org/jira/browse/HIVE-5657
             Project: Hive
          Issue Type: Bug
            Reporter: Sergey Shelukhin
            Priority: Critical
         Attachments: example.patch

Attached patch illustrates the problem.
limit_pushdown test has various other cases of aggregations and distincts, 
incl. count-distinct, that work correctly (that said, src dataset is bad for 
testing these things because every count, for example, produces one record 
only), so something must be special about this.
I am not very familiar with distinct- code and these nuances; if someone knows 
a quick fix feel free to take this, otherwise I will probably start looking 
next week. 




--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to