[
https://issues.apache.org/jira/browse/HIVE-28935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shohei Okumiya updated HIVE-28935:
----------------------------------
Description:
I observed that some records in the current snapshot were replicated after
running a minor compaction. I reproduced the issue when I combined a minor
compaction, bucket transform, and partition evolutions. I've not identified
which factor caused the issue.
This is a reproduction. The result set of `SELECT * FROM default.srcbucket_big
ORDER BY id` changes after the compaction.
[https://github.com/okumin/hive/commit/3949afb1d4b96714571123c5087e89ff078200cc#r156028049]
was:
I observed that some records were replicated after running a minor compaction.
I reproduced the issue when I combined a minor compaction, bucket transform,
and partition evolutions. I've not identified which factor caused the issue.
https://github.com/okumin/hive/commit/3949afb1d4b96714571123c5087e89ff078200cc#r156028049
> Iceberg's Minor Compaction replicates records
> ---------------------------------------------
>
> Key: HIVE-28935
> URL: https://issues.apache.org/jira/browse/HIVE-28935
> Project: Hive
> Issue Type: Bug
> Components: Iceberg integration
> Reporter: Shohei Okumiya
> Priority: Critical
>
> I observed that some records in the current snapshot were replicated after
> running a minor compaction. I reproduced the issue when I combined a minor
> compaction, bucket transform, and partition evolutions. I've not identified
> which factor caused the issue.
> This is a reproduction. The result set of `SELECT * FROM
> default.srcbucket_big ORDER BY id` changes after the compaction.
> [https://github.com/okumin/hive/commit/3949afb1d4b96714571123c5087e89ff078200cc#r156028049]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)