[ https://issues.apache.org/jira/browse/HIVE-14265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15382422#comment-15382422 ]
Hive QA commented on HIVE-14265: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12818539/HIVE-14265.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 7 failed/errored test(s), 10334 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_acid_globallimit org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_subquery_multiinsert org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_acid_globallimit org.apache.hadoop.hive.llap.daemon.impl.TestLlapTokenChecker.testCheckPermissions org.apache.hadoop.hive.llap.daemon.impl.TestLlapTokenChecker.testGetToken org.apache.hadoop.hive.metastore.TestMetaStoreMetrics.testConnections {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-MASTER-Build/565/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-MASTER-Build/565/console Test logs: http://ec2-204-236-174-241.us-west-1.compute.amazonaws.com/logs/PreCommit-HIVE-MASTER-Build-565/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 7 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12818539 - PreCommit-HIVE-MASTER-Build > Partial stats in Join operator may lead to data size estimate of 0 > ------------------------------------------------------------------ > > Key: HIVE-14265 > URL: https://issues.apache.org/jira/browse/HIVE-14265 > Project: Hive > Issue Type: Bug > Components: Statistics > Reporter: Nita Dembla > Assignee: Jesus Camacho Rodriguez > Attachments: HIVE-14265.patch > > > For some tables, we might not have the column stats available. However, if > the table is partitioned, we will have the stats for partition columns. > When we estimate the size of the data produced by a join operator, we end up > using only the columns that are available for the calculation e.g. partition > columns in this case. > However, even in these cases, we should add the data size for those columns > for which we do not have stats (_default size for the column type x estimated > number of rows_). > To reproduce, the following example can be used: > {noformat} > create table sample_partitioned (x int) partitioned by (y int); > insert into sample_partitioned partition(y=1) values (1),(2); > create temporary table sample as select * from sample_partitioned; > analyze table sample compute statistics for columns; > explain select sample_partitioned.x from sample_partitioned, sample where > sample.y = sample_partitioned.y; > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)