[ https://issues.apache.org/jira/browse/HIVE-7870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14125726#comment-14125726 ]
Chao commented on HIVE-7870: ---------------------------- OK, I think I understand the code now (BTW, forgive me if I'm wrong, seems like you can refactor the code for re-constructing linkedfilesinkdesc by removing some common code). Also, just curious, if I just remove the line {{context.fileSinkSet.add(fileSink)}} in {{removeUnionOperators}}, will it generate the same result? > Insert overwrite table query does not generate correct task plan [Spark > Branch] > ------------------------------------------------------------------------------- > > Key: HIVE-7870 > URL: https://issues.apache.org/jira/browse/HIVE-7870 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Na Yang > Assignee: Na Yang > Labels: Spark-M1 > Attachments: HIVE-7870.1-spark.patch, HIVE-7870.2-spark.patch, > HIVE-7870.3-spark.patch, HIVE-7870.4-spark.patch, HIVE-7870.5-spark.patch > > > Insert overwrite table query does not generate correct task plan when > hive.optimize.union.remove and hive.merge.sparkfiles properties are ON. > {noformat} > set hive.optimize.union.remove=true > set hive.merge.sparkfiles=true > insert overwrite table outputTbl1 > SELECT * FROM > ( > select key, 1 as values from inputTbl1 > union all > select * FROM ( > SELECT key, count(1) as values from inputTbl1 group by key > UNION ALL > SELECT key, 2 as values from inputTbl1 > ) a > )b; > select * from outputTbl1 order by key, values; > {noformat} > query result > {noformat} > 1 1 > 1 2 > 2 1 > 2 2 > 3 1 > 3 2 > 7 1 > 7 2 > 8 2 > 8 2 > 8 2 > {noformat} > expected result: > {noformat} > 1 1 > 1 1 > 1 2 > 2 1 > 2 1 > 2 2 > 3 1 > 3 1 > 3 2 > 7 1 > 7 1 > 7 2 > 8 1 > 8 1 > 8 2 > 8 2 > 8 2 > {noformat} > Move work is not working properly and some data are missing during move. -- This message was sent by Atlassian JIRA (v6.3.4#6332)