Karel Kolman created HIVE-21798: ----------------------------------- Summary: OrcInputFormat#generateSplitsInfo returns 0 splits for a transactional delta-only table Key: HIVE-21798 URL: https://issues.apache.org/jira/browse/HIVE-21798 Project: Hive Issue Type: Bug Affects Versions: 3.1.0 Reporter: Karel Kolman
OrcInputFormat#generateSplitsInfo returns 0 splits for acid table with having no base, having deltas only. Table data (no base) {noformat} $ hadoop fs -ls /apps/hive/warehouse/test_table drwxr-xr-x - hive hdfs 0 2019-05-28 13:13 /apps/hive/warehouse/test_table/delta_0000001_0000001_0000 $ hadoop fs -ls -r /apps/hive/warehouse/test_table/delta_0000001_0000001_0000 Found 4 items -rw-r--r-- 3 hive hdfs 1302 2019-05-28 13:13 /apps/hive/warehouse/test_table/delta_0000001_0000001_0000/bucket_00002 -rw-r--r-- 3 hive hdfs 1250 2019-05-28 13:13 /apps/hive/warehouse/test_table/delta_0000001_0000001_0000/bucket_00001 -rw-r--r-- 3 hive hdfs 1215 2019-05-28 13:13 /apps/hive/warehouse/test_table/delta_0000001_0000001_0000/bucket_00000 -rw-r--r-- 3 hive hdfs 1 2019-05-28 13:13 /apps/hive/warehouse/test_table/delta_0000001_0000001_0000/_orc_acid_version{noformat} Table created with {noformat} CREATE TABLE test_table (userid bigint) clustered by (userid) into 3 buckets stored as orc TBLPROPERTIES ('transactional'='true'); INSERT INTO TABLE test_table SELECT * FROM source_table; {noformat} Is this expected behavior ? Change of behavior from earlier Hive versions seems to come from commit [https://github.com/apache/hive/commit/34b0e07a3bd5002b197b749e3e5a7992e196c237#diff-ede47c0c518f8577dd3895984f019330L1037] related to HIVE-17089. -- This message was sent by Atlassian JIRA (v7.6.3#76005)