Zhihua Deng created HIVE-27494:
----------------------------------

             Summary: Deduplicate the task result that generated by more 
branches in union all
                 Key: HIVE-27494
                 URL: https://issues.apache.org/jira/browse/HIVE-27494
             Project: Hive
          Issue Type: Bug
            Reporter: Zhihua Deng
            Assignee: Zhihua Deng


HIVE-23891 adds the ability to deduplicate the task result that under the 
directory,

<table-dir>/<staging-dir>/_tmp.-ext-10000/<dynamic-partition-dir>/HIVE_UNION_SUBDIR_1,

but turns out to ignore taking the same action to the directory for the same 
query:

<table-dir>/<staging-dir>/_tmp.-ext-10000/<dynamic-partition-dir>/HIVE_UNION_SUBDIR_2.

So user may still have the same data duplication problem in multiple tez task 
attempts.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to