[ 
https://issues.apache.org/jira/browse/HIVE-21172?focusedWorklogId=695748&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-695748
 ]

ASF GitHub Bot logged work on HIVE-21172:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 14/Dec/21 13:16
            Start Date: 14/Dec/21 13:16
    Worklog Time Spent: 10m 
      Work Description: kasakrisz commented on a change in pull request #2857:
URL: https://github.com/apache/hive/pull/2857#discussion_r768652584



##########
File path: ql/src/test/results/clientpositive/llap/masking_acid_no_masking.q.out
##########
@@ -54,8 +53,9 @@ POSTHOOK: Input: default@masking_acid_no_masking
 POSTHOOK: Input: default@nonacid_n0
 POSTHOOK: Output: default@masking_acid_no_masking
 POSTHOOK: Output: default@masking_acid_no_masking
-POSTHOOK: Output: default@masking_acid_no_masking
 POSTHOOK: Output: default@merge_tmp_table
 POSTHOOK: Lineage: masking_acid_no_masking.key SIMPLE 
[(nonacid_n0)s.FieldSchema(name:key, type:int, comment:null), ]
+POSTHOOK: Lineage: masking_acid_no_masking.key SIMPLE 
[(nonacid_n0)s.FieldSchema(name:key, type:int, comment:null), ]

Review comment:
       These lineages are generated by the MoveTask when inserting.
   
   By turning on `hive.merge.split.update` update branch of merge statements 
are splitted into a insert and a delete branch.
   
   Originally this merge had only one insert branch but now it has two to the 
same table same columns:
   - one for the insert branch
   - one for the update branch
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 695748)
    Time Spent: 0.5h  (was: 20m)

> DEFAULT keyword handling in MERGE UPDATE clause issues
> ------------------------------------------------------
>
>                 Key: HIVE-21172
>                 URL: https://issues.apache.org/jira/browse/HIVE-21172
>             Project: Hive
>          Issue Type: Sub-task
>          Components: SQL, Transactions
>    Affects Versions: 4.0.0
>            Reporter: Eugene Koifman
>            Assignee: Krisztian Kasa
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> once HIVE-21159 lands, enable {{HiveConf.MERGE_SPLIT_UPDATE}} and run these 
> tests.
> TestMiniLlapLocalCliDriver.testCliDriver[sqlmerge_stats]
>  mvn test -Dtest=TestMiniLlapLocalCliDriver 
> -Dqfile=insert_into_default_keyword.q
> Merge is rewritten as a multi-insert. When Update clause has DEFAULT, it's 
> not properly replaced with a value in the muli-insert - it's treated as a 
> literal
> {noformat}
> INSERT INTO `default`.`acidTable`    -- update clause(insert part)
>  SELECT `t`.`key`, `DEFAULT`, `t`.`value`
>    WHERE `t`.`key` = `s`.`key` AND `s`.`key` > 3 AND NOT(`s`.`key` < 3)
> {noformat}
> See {{LOG.info("Going to reparse <" + originalQuery + "> as \n<" + 
> rewrittenQueryStr.toString() + ">");}} in hive.log
> {{MergeSemanticAnalyzer.replaceDefaultKeywordForMerge()}} is only called in 
> {{handleInsert}} but not {{handleUpdate()}}. Why does issue only show up with 
> {{MERGE_SPLIT_UPDATE}}?
> Once this is fixed, HiveConf.MERGE_SPLIT_UPDATE should be true by default



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to