Shreepadma Venugopalan created HIVE-3516: --------------------------------------------
Summary: Fast incremental statistics computation on column in Hive tables Key: HIVE-3516 URL: https://issues.apache.org/jira/browse/HIVE-3516 Project: Hive Issue Type: Bug Components: Statistics Reporter: Shreepadma Venugopalan Assignee: Shreepadma Venugopalan Statistics computed on Hive columns in partition can be rolled up to avoid scanning the table again to compute column statistics at the table(global) level. While its straightforward to roll up some statistics such as max, min, avgcollen, maxcollen etc, rolling up other statistics such as ndv requires maintaining intermediate state. This ticket covers the task of a) maintaining the necessary intermediate state needed to roll up partition level statistics b) detecting that the partition level statistics can be rolled up and actually computing table level statistics from partition level statistics. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira