[ https://issues.apache.org/jira/browse/HIVE-18851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sergey Shelukhin updated HIVE-18851: ------------------------------------ Labels: ACID (was: ) > make Hive basic stats valid for ACID; clean up and refactor the code > -------------------------------------------------------------------- > > Key: HIVE-18851 > URL: https://issues.apache.org/jira/browse/HIVE-18851 > Project: Hive > Issue Type: Bug > Reporter: Sergey Shelukhin > Priority: Major > Labels: ACID > > HIVE-18571 that started as a couple small fixes for MM tables, but ends up > making stats for ACID tables work better in general, but not rigorously and > not for all cases. > This is a follow-up JIRA to implement stats for ACID properly (potentially > also with ACID semantics similar to those of queries, but that could be > another follow-up - for now, at least they should be based on the correct set > of files). > Overall I've discovered that Hive stats code is spread all over in random > places in code base and is brittle and inconsistent, esp. for any complex > scenario like ACID tables. > So, instead of making ad-hoc fixes everywhere, I think at the minimum it > should be moved to a single spot (so that e.g. BasicStatsTask, > BasicStatsTaskNoJob, metastore "quick" stats generation, etc all use the same > code with the same logic) and made valid for ACID. -- This message was sent by Atlassian JIRA (v7.6.3#76005)