[ https://issues.apache.org/jira/browse/HIVE-7731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14097321#comment-14097321 ]
Thejas M Nair commented on HIVE-7731: ------------------------------------- bq. We've been placing "[Spark Branch]" in commit messages and I'll be more diligent about ensuring tasks are sub-tasks of HIVE-7292 and that we have "[Spark Branch]" in the title. Thanks Brock! That will really help avoid confusion in the auto-generated release notes. > Incorrect result returned when a map work has multiple downstream reduce > works [Spark Branch] > --------------------------------------------------------------------------------------------- > > Key: HIVE-7731 > URL: https://issues.apache.org/jira/browse/HIVE-7731 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Rui Li > Assignee: Chao > > Encountered when running on spark. Suppose we have three tables: > {noformat} > table1(x int, y int); > table2(x int); > table3(x int); > {noformat} > I run the following query: > {noformat} > from table1 > insert overwrite table table2 select x group by x > insert overwrite table table3 select y group by y; > {noformat} > The query generates 1 map and 2 reduces. The map operator has 2 RS, so I > suppose it has output for both reduces. > The problem is all (incorrect) results go to table2 and table3 is empty. > I tried the same query on MR and it gives correct results. -- This message was sent by Atlassian JIRA (v6.2#6252)