Jesus Camacho Rodriguez created HIVE-13089:
----------------------------------------------

             Summary: Rounding in Stats for equality expressions
                 Key: HIVE-13089
                 URL: https://issues.apache.org/jira/browse/HIVE-13089
             Project: Hive
          Issue Type: Bug
          Components: Statistics
    Affects Versions: 2.1.0
            Reporter: Jesus Camacho Rodriguez
            Assignee: Jesus Camacho Rodriguez


Currently we divide numRows(long) by countDistinct(long), thus ignoring the 
decimals. We should do proper rounding.

This is specially useful for equality expressions over columns whose values are 
unique. As NDV estimates allow for a certain error, if countDistinct > numRows, 
we end up with 0 rows in the estimate for the expression.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to