[ https://issues.apache.org/jira/browse/HIVE-28578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HIVE-28578: ---------------------------------- Labels: pull-request-available (was: ) > Concurrency issue in updateTableColumnStatistics > ------------------------------------------------ > > Key: HIVE-28578 > URL: https://issues.apache.org/jira/browse/HIVE-28578 > Project: Hive > Issue Type: Bug > Security Level: Public(Viewable by anyone) > Components: Standalone Metastore > Reporter: Zsolt Miskolczi > Assignee: Zsolt Miskolczi > Priority: Major > Labels: pull-request-available > Attachments: Repro.java > > > Summary: > updateTableColumnStatistics can throw > SQLIntegrityConstraintViolationException during replication if HA is on and > two different HMS instance gets the same call but with different engine. > > Workaround: > Update table column statistics in single threaded. > > Details: > updateTableColumnStatistics has a relative long running transaction. In that > transaction, it validates the actual parameters, queries the metastore db > against the TABLE_PARAMS that are already stored and makes a decision based > on that. After this, it uses data nucleus to persist the new statistics. > From the two HMS instances, one can save the column statistics. And the other > cannot as the first instance already saved them. > -- This message was sent by Atlassian Jira (v8.20.10#820010)