[ https://issues.apache.org/jira/browse/HUDI-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kate Huber updated HUDI-8654: ----------------------------- Status: In Progress (was: Patch Available) > Support correct merging results with record positions in log blocks generated > during pending compaction > ------------------------------------------------------------------------------------------------------- > > Key: HUDI-8654 > URL: https://issues.apache.org/jira/browse/HUDI-8654 > Project: Apache Hudi > Issue Type: Sub-task > Reporter: Y Ethan Guo > Assignee: Y Ethan Guo > Priority: Blocker > Labels: pull-request-available > Fix For: 1.0.1 > > Original Estimate: 20h > Time Spent: 4h > Remaining Estimate: 16h > > When there is a pending compaction, the new base files to be generated by > compaction is not available during this transaction. Given the log files in > MOR from this transaction can be attached to the base file generated by the > compaction in the latest file slice, the accurate record positions may not be > derived. However, the log files written in later delta commits after > completed compaction have accurate positions. > Similarly, for NBCC, the compaction can be schedule during an inflight > deltacommit, and in this case the log file generated by the inflight > deltacommit is associated with the new base file from the compaction, which > may have different positions because of deletes. > We need to make sure that the file group reader with position-based merging > generates the correct results in such mix of log blocks. -- This message was sent by Atlassian Jira (v8.20.10#820010)