[ https://issues.apache.org/jira/browse/HIVE-21917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16975574#comment-16975574 ]
Hive QA commented on HIVE-21917: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12985968/HIVE-21917.2.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 3 failed/errored test(s), 17706 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.ql.TestWarehouseExternalDir.testManagedPaths (batchId=276) org.apache.hive.jdbc.TestSSL.testMetastoreWithSSL (batchId=284) org.apache.hive.service.TestHS2ImpersonationWithRemoteMS.testImpersonation (batchId=283) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/19449/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/19449/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-19449/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 3 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12985968 - PreCommit-HIVE-Build > COMPLETED_TXN_COMPONENTS table is never cleaned up unless Compactor runs > ------------------------------------------------------------------------ > > Key: HIVE-21917 > URL: https://issues.apache.org/jira/browse/HIVE-21917 > Project: Hive > Issue Type: Bug > Components: Transactions > Affects Versions: 3.1.0, 3.1.1 > Reporter: Craig Condit > Assignee: Denys Kuzmenko > Priority: Major > Attachments: HIVE-21917.1.patch, HIVE-21917.2.patch > > > The Initiator thread in the metastore repeatedly loops over entries in the > COMPLETED_TXN_COMPONENTS table to determine which partitions / tables might > need to be compacted. However, entries are never removed from this table > except by a completed Compactor run. > In a cluster where most tables / partitions are write-once read-many, this > results in stale entries in this table never being cleaned up. In a small > test cluster, we have observed approximately 45k entries in this table > (virtually equal to the number of partitions in the cluster) while < 100 of > these tables have delta files at all. Since most of the tables will never get > enough writes to trigger a compaction (and in fact have only ever been > written to once), the initiator thread keeps trying to evaluate them on every > loop. > On this test cluster, it takes approximately 10 minutes to loop through all > the entries and results in severe performance degradation on metastore > operations. With the default run timing of 5 minutes, the initiator basically > never stops running. > On a production cluster with 2M partitions, this would be a non-starter. > The initiator thread should proactively remove entries from > COMPLETED_TXN_COMPONENTS when it determines that a compaction is not needed, > so that they are not evaluated again on the next loop. > -- This message was sent by Atlassian Jira (v8.3.4#803005)