[ https://issues.apache.org/jira/browse/HIVE-3516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13466298#comment-13466298 ]
Shreepadma Venugopalan commented on HIVE-3516: ---------------------------------------------- HIVE-1362 adds support for computing statistics on columns in tables and partitions. > Fast incremental statistics computation on column in Hive tables > ---------------------------------------------------------------- > > Key: HIVE-3516 > URL: https://issues.apache.org/jira/browse/HIVE-3516 > Project: Hive > Issue Type: Bug > Components: Statistics > Reporter: Shreepadma Venugopalan > Assignee: Shreepadma Venugopalan > > Statistics computed on Hive columns in partition can be rolled up to avoid > scanning the table again to compute column statistics at the table(global) > level. While its straightforward to roll up some statistics such as max, min, > avgcollen, maxcollen etc, rolling up other statistics such as ndv requires > maintaining intermediate state. This ticket covers the task of a) maintaining > the necessary intermediate state needed to roll up partition level statistics > b) detecting that the partition level statistics can be rolled up and > actually computing table level statistics from partition level statistics. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira