[ 
https://issues.apache.org/jira/browse/HIVE-28935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shohei Okumiya updated HIVE-28935:
----------------------------------
    Description: 
I observed that some records in the current snapshot were replicated after 
running a minor compaction. I reproduced the issue when I combined a minor 
compaction, bucket transform, and partition evolutions. I've not identified 
which factor caused the issue.

This is a reproduction. The result set of `SELECT * FROM default.srcbucket_big 
ORDER BY id` changes after the compaction.

[https://github.com/okumin/hive/commit/3949afb1d4b96714571123c5087e89ff078200cc#r156028049]

  was:
I observed that some records were replicated after running a minor compaction. 
I reproduced the issue when I combined a minor compaction, bucket transform, 
and partition evolutions. I've not identified which factor caused the issue.

https://github.com/okumin/hive/commit/3949afb1d4b96714571123c5087e89ff078200cc#r156028049


> Iceberg's Minor Compaction replicates records
> ---------------------------------------------
>
>                 Key: HIVE-28935
>                 URL: https://issues.apache.org/jira/browse/HIVE-28935
>             Project: Hive
>          Issue Type: Bug
>          Components: Iceberg integration
>            Reporter: Shohei Okumiya
>            Priority: Critical
>
> I observed that some records in the current snapshot were replicated after 
> running a minor compaction. I reproduced the issue when I combined a minor 
> compaction, bucket transform, and partition evolutions. I've not identified 
> which factor caused the issue.
> This is a reproduction. The result set of `SELECT * FROM 
> default.srcbucket_big ORDER BY id` changes after the compaction.
> [https://github.com/okumin/hive/commit/3949afb1d4b96714571123c5087e89ff078200cc#r156028049]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to