[ https://issues.apache.org/jira/browse/HIVE-15539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15798710#comment-15798710 ]
Hive QA commented on HIVE-15539: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12845559/HIVE-15539.patch {color:green}SUCCESS:{color} +1 due to 2 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 91 failed/errored test(s), 10915 tests executed *Failed tests:* {noformat} TestDerbyConnector - did not produce a TEST-*.xml file (likely timed out) (batchId=233) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[allcolref_in_udf] (batchId=48) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[auto_sortmerge_join_13] (batchId=59) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[autogen_colalias] (batchId=25) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[case_sensitivity] (batchId=61) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[ctas_colname] (batchId=53) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[ctas_date] (batchId=1) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[groupby_sort_1_23] (batchId=71) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[groupby_sort_skew_1_23] (batchId=8) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[input_testxpath] (batchId=28) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[masking_8] (batchId=7) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[materialized_view_create] (batchId=65) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_ctas] (batchId=61) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[parquet_join] (batchId=18) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[partition_date2] (batchId=16) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[partition_timestamp2] (batchId=29) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[ppd_transform] (batchId=69) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[print_header] (batchId=7) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[timestamp_udf] (batchId=71) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[udf_coalesce] (batchId=75) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union17] (batchId=66) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union18] (batchId=16) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union19] (batchId=55) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union31] (batchId=3) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union34] (batchId=11) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union_remove_6] (batchId=40) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union_remove_6_subq] (batchId=35) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vector_complex_all] (batchId=55) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vector_complex_join] (batchId=40) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vector_interval_arithmetic] (batchId=4) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vector_tablesample_rows] (batchId=47) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[vectorized_date_funcs] (batchId=69) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[explainuser_2] (batchId=136) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[orc_ppd_basic] (batchId=134) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[unionDistinct_1] (batchId=135) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[auto_smb_mapjoin_14] (batchId=149) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[auto_sortmerge_join_13] (batchId=149) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[lineage2] (batchId=148) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[lineage3] (batchId=146) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[offset_limit_ppd_optimizer] (batchId=150) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_acid_part_update] (batchId=150) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_acid_table] (batchId=144) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_acid_table_update] (batchId=150) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_acidvec_part_update] (batchId=139) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_acidvec_table] (batchId=153) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_acidvec_table_update] (batchId=138) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_nonvec_table] (batchId=139) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_orc_vec_table] (batchId=145) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_text_nonvec_table] (batchId=140) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_text_vec_table] (batchId=146) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_text_vecrow_table] (batchId=139) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[tez_union_multiinsert] (batchId=144) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_auto_smb_mapjoin_14] (batchId=143) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_complex_all] (batchId=149) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_complex_join] (batchId=146) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_interval_arithmetic] (batchId=138) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_number_compare_projection] (batchId=139) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vectorization_0] (batchId=153) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vectorized_date_funcs] (batchId=151) org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_2] (batchId=93) org.apache.hadoop.hive.cli.TestNegativeCliDriver.testCliDriver[ctasnullcol] (batchId=85) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[auto_smb_mapjoin_14] (batchId=119) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[auto_sortmerge_join_13] (batchId=121) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[bucket_map_join_spark2] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[bucketmapjoin2] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[cbo_udf_udaf] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[groupby_cube1] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[groupby_sort_1_23] (batchId=128) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[groupby_sort_skew_1_23] (batchId=98) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[join13] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[join_vc] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[parquet_join] (batchId=102) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[ppd_transform] (batchId=127) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[ptf_decimal] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[sample3] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[smb_mapjoin_19] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[timestamp_udf] (batchId=128) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union17] (batchId=124) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union18] (batchId=101) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union19] (batchId=119) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union23] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union31] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union34] (batchId=99) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union_remove_6] (batchId=113) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union_remove_6_subq] (batchId=110) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union_remove_7] (batchId=95) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[vectorization_0] (batchId=129) org.apache.hive.jdbc.TestJdbcDriver2.testBuiltInUDFCol (batchId=213) org.apache.hive.jdbc.TestJdbcDriver2.testResultSetMetaData (batchId=213) org.apache.hive.jdbc.TestJdbcDriver2.testResultSetMetaDataDuplicateColumnNames (batchId=213) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/2778/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/2778/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-2778/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 91 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12845559 - PreCommit-HIVE-Build > Optimize complex multi-insert queries in Calcite > ------------------------------------------------ > > Key: HIVE-15539 > URL: https://issues.apache.org/jira/browse/HIVE-15539 > Project: Hive > Issue Type: Improvement > Components: Parser > Affects Versions: 2.2.0 > Reporter: Jesus Camacho Rodriguez > Assignee: Jesus Camacho Rodriguez > Attachments: HIVE-15539.patch > > > Currently multi-insert queries are not optimized by Calcite. Proper > integration with Calcite would include creating a _spool_ operator whose > output is reused by every _insert_ statement; however, _spool_ operator has > not been added to Calcite yet (CALCITE-481). > In the meantime, and since complex logic for multi-insert queries is in FROM > clause, we can optimize the FROM clause with Calcite and connect the > optimized result to the original query. > Initially, we will recognize three different cases: > - FROM clause is trivial, e.g., table reference, or not supported. No need to > optimize with Calcite. > - FROM clause is a subquery. Optimize the subquery with Calcite. > - FROM clause is a join. Rewrite join into a subquery and optimize it with > Calcite. Change references in INSERT statements to refer to subquery columns. > This should be beneficial for MERGE statements processing too, since MERGE > statements are treated as multi-insert queries by Hive. -- This message was sent by Atlassian JIRA (v6.3.4#6332)