[ https://issues.apache.org/jira/browse/HIVE-20262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16560912#comment-16560912 ]
Hive QA commented on HIVE-20262: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12933488/HIVE-20262.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 20 failed/errored test(s), 14815 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[allcolref_in_udf] (batchId=55) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[except_all] (batchId=49) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[lateral_view_cp] (batchId=91) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[lateral_view_outer] (batchId=45) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[lateral_view_ppd] (batchId=93) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[limit_pushdown_negative] (batchId=43) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[nonmr_fetch] (batchId=21) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[ppd_udtf] (batchId=66) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[skewjoin_mapjoin6] (batchId=63) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[skewjoinopt10] (batchId=21) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[tablevalues] (batchId=72) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[udtf_json_tuple] (batchId=84) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[udtf_parse_url_tuple] (batchId=78) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union26] (batchId=70) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[union_lateralview] (batchId=26) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[unionDistinct_1] (batchId=152) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[multi_insert_lateral_view] (batchId=132) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[skewjoinopt10] (batchId=117) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union26] (batchId=138) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[union_lateralview] (batchId=120) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/12924/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/12924/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-12924/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 20 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12933488 - PreCommit-HIVE-Build > Implement stats annotation rule for the UDTFOperator > ---------------------------------------------------- > > Key: HIVE-20262 > URL: https://issues.apache.org/jira/browse/HIVE-20262 > Project: Hive > Issue Type: Improvement > Components: Physical Optimizer > Reporter: George Pachitariu > Assignee: George Pachitariu > Priority: Minor > Attachments: HIVE-20262.patch > > > User Defined Table Functions (UDTFs) change the number of rows of the output. > A common UDTF is the explode() method that creates a row for each element for > each array in the input column. > > Right now, the number of output rows is equal to the number of input rows. > But if the average number of output rows is bigger than 1, the resulting > number of rows is underestimated in the execution plan. > > Implement a rule that can have a factor X as a parameter and for each UDTF > function predict that: > > {code:java} > number of output rows = X * number of input rows{code} > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)