[ https://issues.apache.org/jira/browse/HIVE-8331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157005#comment-14157005 ]
Chao commented on HIVE-8331: ---------------------------- Discussed with [~szehon] offline - we think for {{parallel_join0.q}} and {{parallel_join1.q}}, it's just a ordering problem, and I think it's OK for MR and Spark to have different result order. {{vectorized_shufflejoin.q}} and {{union22.q}} have same results but different plans as MR. I think they are good. For {{union_remove_18.q}}, the unit test result is different from MR, and looks wrong to me. Interestingly, when I run the query in CLI spark mode, the result is the same as MR. I need to debug this. > HIVE-8303 followup, investigate result diff [Spark Branch] > ---------------------------------------------------------- > > Key: HIVE-8331 > URL: https://issues.apache.org/jira/browse/HIVE-8331 > Project: Hive > Issue Type: Task > Components: Spark > Reporter: Xuefu Zhang > Assignee: Chao > > HIVE-8303 patch introduced some result diffs for some spark tests. We need to > investigate those, including parallel_join0.q, union22.q, > vectorized_shufflejoin.q, union_remove_18.q, and maybe more. > Also the investigation includes the test failures related to spark. > Specifically, union_remove_18.q demonstrated random order. -- This message was sent by Atlassian JIRA (v6.3.4#6332)