[ https://issues.apache.org/jira/browse/HIVE-16957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16721904#comment-16721904 ]
Hive QA commented on HIVE-16957: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12951854/HIVE-16957.03.patch {color:green}SUCCESS:{color} +1 due to 29 test(s) being added or modified. {color:green}SUCCESS:{color} +1 due to 15724 tests passed Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/15333/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/15333/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-15333/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase {noformat} This message is automatically generated. ATTACHMENT ID: 12951854 - PreCommit-HIVE-Build > Support CTAS for auto gather column stats > ----------------------------------------- > > Key: HIVE-16957 > URL: https://issues.apache.org/jira/browse/HIVE-16957 > Project: Hive > Issue Type: Sub-task > Reporter: Pengcheng Xiong > Assignee: Jesus Camacho Rodriguez > Priority: Major > Attachments: HIVE-16957.01.patch, HIVE-16957.02.patch, > HIVE-16957.03.patch, HIVE-16957.patch > > > The idea is to rely as much as possible on the logic in > ColumnStatsSemanticAnalyzer as other operations do. In particular, they > create a 'analyze table t compute statistics for columns', use > ColumnStatsSemanticAnalyzer to parse it, and connect resulting plan to > existing INSERT/INSERT OVERWRITE statement. The challenge for CTAS or CREATE > MATERIALIZED VIEW is that the table object does not exist yet, hence we > cannot rely fully on ColumnStatsSemanticAnalyzer. > Thus, we use same process, but ColumnStatsSemanticAnalyzer produces a > statement for column stats collection that uses a table values clause instead > of the original table reference: > {code} > select compute_stats(col1), compute_stats(col2), compute_stats(col3) > from table(values(cast(null as int), cast(null as int), cast(null as > string))) as t(col1, col2, col3); > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)