----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/25394/ -----------------------------------------------------------
(Updated Sept. 20, 2014, 1:33 a.m.) Review request for hive, Brock Noland and Xuefu Zhang. Bugs: HIVE-7503 https://issues.apache.org/jira/browse/HIVE-7503 Repository: hive-git Description ------- For Hive's multi insert query (https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML), there may be an MR job for each insert. When we achieve this with Spark, it would be nice if all the inserts can happen concurrently. It seems that this functionality isn't available in Spark. To make things worse, the source of the insert may be re-computed unless it's staged. Even with this, the inserts will happen sequentially, making the performance suffer. This task is to find out what takes in Spark to enable this without requiring staging the source and sequential insertion. If this has to be solved in Hive, find out an optimum way to do this. Diffs (updated) ----- ql/src/java/org/apache/hadoop/hive/ql/parse/spark/GenSparkProcContext.java 4211a07 ql/src/java/org/apache/hadoop/hive/ql/parse/spark/GenSparkUtils.java 695d8b9 ql/src/java/org/apache/hadoop/hive/ql/parse/spark/GenSparkWork.java 864965e ql/src/java/org/apache/hadoop/hive/ql/parse/spark/SparkCompiler.java 76fc290 ql/src/java/org/apache/hadoop/hive/ql/parse/spark/SparkMergeTaskProcessor.java PRE-CREATION ql/src/java/org/apache/hadoop/hive/ql/parse/spark/SparkMultiInsertionProcessor.java PRE-CREATION ql/src/java/org/apache/hadoop/hive/ql/parse/spark/SparkProcessAnalyzeTable.java 5fcaf64 ql/src/java/org/apache/hadoop/hive/ql/parse/spark/SparkTableScanProcessor.java PRE-CREATION ql/src/test/results/clientpositive/spark/insert1.q.out 49fb1d4 ql/src/test/results/clientpositive/spark/union18.q.out 9a40807 ql/src/test/results/clientpositive/spark/union19.q.out 131591f ql/src/test/results/clientpositive/spark/union_remove_6.q.out 1bc55f4 Diff: https://reviews.apache.org/r/25394/diff/ Testing ------- Thanks, Chao Sun