[ https://issues.apache.org/jira/browse/HIVE-21164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060447#comment-17060447 ]
Sungwoo commented on HIVE-21164: -------------------------------- [~kuczoram] In my testing, both select queries return non-empty lists, but the final ORC table is empty (which implies that Tez is okay while our execution engine has a bug): {code:sql} select ss.ss_sold_time_sk, ... ss.ss_net_profit, ss.ss_sold_date_sk where ss.ss_sold_date_sk is not null {code} {code:sql} select ss.ss_sold_time_sk, ... ss.ss_net_profit, ss.ss_sold_date_sk where ss.ss_sold_date_sk is null sort by ss.ss_sold_date_sk {code} If we use another second select query returning an empty list, the final ORC table is still empty, but this observation is not useful here because even when the second select query returns a non-empty list, the final ORC table is empty anyway. Let me try to set up an environment for testing Hive 4 on Tez (hopefully by the end of this week) and report the result. > ACID: explore how we can avoid a move step during inserts/compaction > -------------------------------------------------------------------- > > Key: HIVE-21164 > URL: https://issues.apache.org/jira/browse/HIVE-21164 > Project: Hive > Issue Type: Bug > Components: Transactions > Affects Versions: 3.1.1 > Reporter: Vaibhav Gumashta > Assignee: Marta Kuczora > Priority: Major > Fix For: 4.0.0 > > Attachments: HIVE-21164.1.patch, HIVE-21164.10.patch, > HIVE-21164.11.patch, HIVE-21164.11.patch, HIVE-21164.12.patch, > HIVE-21164.13.patch, HIVE-21164.14.patch, HIVE-21164.14.patch, > HIVE-21164.15.patch, HIVE-21164.16.patch, HIVE-21164.17.patch, > HIVE-21164.18.patch, HIVE-21164.19.patch, HIVE-21164.2.patch, > HIVE-21164.20.patch, HIVE-21164.21.patch, HIVE-21164.22.patch, > HIVE-21164.3.patch, HIVE-21164.4.patch, HIVE-21164.5.patch, > HIVE-21164.6.patch, HIVE-21164.7.patch, HIVE-21164.8.patch, HIVE-21164.9.patch > > > Currently, we write compacted data to a temporary location and then move the > files to a final location, which is an expensive operation on some cloud file > systems. Since HIVE-20823 is already in, it can control the visibility of > compacted data for the readers. Therefore, we can perhaps avoid writing data > to a temporary location and directly write compacted data to the intended > final path. -- This message was sent by Atlassian Jira (v8.3.4#803005)