Prasanth J created HIVE-7589:
--------------------------------

             Summary: Some fixes and improvements to statistics annotation rules
                 Key: HIVE-7589
                 URL: https://issues.apache.org/jira/browse/HIVE-7589
             Project: Hive
          Issue Type: Sub-task
    Affects Versions: 0.14.0
            Reporter: Prasanth J
            Assignee: Prasanth J


*FIXES:*
1) JOIN rule does not properly propagate the column statistics from its parent
2) Multi-way join rule computes the denominator for #rows estimation wrongly
3) GROUPBY rule does not account for the data size of aggregate column
4) Prefix removal from column names wasn't working
5) GROUPBY rule looks at missing column statistics for aggregate column from 
its parent and assumes PARTIAL column stats state

*IMPROVEMENTS:*
1) Replaced "EXPLAIN EXTENDED" with "EXPLAIN" in test cases to make the golden 
files easy to comprehend and to reduce verbosity
2) Introduced rule for ReduceSink operator which only does renaming of column 
statistics as per output row schema
3) Added more rows to the test datasets to avoid 0 row scenario in join test 
cases
4) JOIN rule improvement to avoid long overflow



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to