[ https://issues.apache.org/jira/browse/HIVE-22755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17020832#comment-17020832 ]
Peter Vary commented on HIVE-22755: ----------------------------------- I have 2 ideas: * Time based approach: use the start time of the minOpenTxn to filter the files to clean * Instead of skipping every read lock, create a read lock per table instead of per partition. This would fix some failures created by the previous patch with the added benefit of knowing what tables are under read. CC: [~lpinter], [~dkuzmenko] > Cleaner/Compaction can skip the read locks and use the min open txn id > ---------------------------------------------------------------------- > > Key: HIVE-22755 > URL: https://issues.apache.org/jira/browse/HIVE-22755 > Project: Hive > Issue Type: Sub-task > Components: Transactions > Reporter: Slim Bouguerra > Priority: Major > Fix For: 4.0.0 > > > The minOpenTxnId is used by the Cleaner here > https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java#L154 > This currently converts it to open write-ids to clean appropriately. -- This message was sent by Atlassian Jira (v8.3.4#803005)