Prasanth J created HIVE-7589: -------------------------------- Summary: Some fixes and improvements to statistics annotation rules Key: HIVE-7589 URL: https://issues.apache.org/jira/browse/HIVE-7589 Project: Hive Issue Type: Sub-task Affects Versions: 0.14.0 Reporter: Prasanth J Assignee: Prasanth J
*FIXES:* 1) JOIN rule does not properly propagate the column statistics from its parent 2) Multi-way join rule computes the denominator for #rows estimation wrongly 3) GROUPBY rule does not account for the data size of aggregate column 4) Prefix removal from column names wasn't working 5) GROUPBY rule looks at missing column statistics for aggregate column from its parent and assumes PARTIAL column stats state *IMPROVEMENTS:* 1) Replaced "EXPLAIN EXTENDED" with "EXPLAIN" in test cases to make the golden files easy to comprehend and to reduce verbosity 2) Introduced rule for ReduceSink operator which only does renaming of column statistics as per output row schema 3) Added more rows to the test datasets to avoid 0 row scenario in join test cases 4) JOIN rule improvement to avoid long overflow -- This message was sent by Atlassian JIRA (v6.2#6252)