Zhihua Deng created HIVE-28700: ---------------------------------- Summary: MRCompactor may cause data loss when performing the major compaction Key: HIVE-28700 URL: https://issues.apache.org/jira/browse/HIVE-28700 Project: Hive Issue Type: Bug Reporter: Zhihua Deng Assignee: Zhihua Deng
Steps to repro: set mapreduce.job.reduces=7; create table ext(a int); insert into table ext values(1),(2),(3),(3),(3),(3),(4),(5),(6),(7); create table full_acid(a int) stored as orc tblproperties("transactional"="true"); insert overwrite table full_acid select * from ext where a = 3; insert into table full_acid select * from ext where a != 3 group by a; select * from full_acid; alter table full_acid compact 'major' and wait; select * from full_acid; After the major compaction, the full_acid table misses records with a = 3; -- This message was sent by Atlassian Jira (v8.20.10#820010)