[ https://issues.apache.org/jira/browse/HIVE-12084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952205#comment-14952205 ]
Hive QA commented on HIVE-12084: -------------------------------- {color:red}Overall{color}: -1 at least one tests failed Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12765940/HIVE-12084.1.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 5 failed/errored test(s), 9657 tests executed *Failed tests:* {noformat} TestSparkNegativeCliDriver - did not produce a TEST-*.xml file org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_explain_rearrange org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udaf_histogram_numeric org.apache.hive.hcatalog.api.TestHCatClient.testTableSchemaPropagation org.apache.hive.jdbc.TestSSL.testSSLVersion {noformat} Test results: http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/5606/testReport Console output: http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/5606/console Test logs: http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-5606/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 5 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12765940 - PreCommit-HIVE-TRUNK-Build > Hive queries with ORDER BY and large LIMIT fails with OutOfMemoryError Java > heap space > -------------------------------------------------------------------------------------- > > Key: HIVE-12084 > URL: https://issues.apache.org/jira/browse/HIVE-12084 > Project: Hive > Issue Type: Bug > Reporter: Hari Sankar Sivarama Subramaniyan > Assignee: Hari Sankar Sivarama Subramaniyan > Attachments: HIVE-12084.1.patch > > > STEPS TO REPRODUCE: > {code} > CREATE TABLE `sample_07` ( `code` string , `description` string , `total_emp` > int , `salary` int ) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS > TextFile; > load data local inpath 'sample_07.csv' into table sample_07; > set hive.limit.pushdown.memory.usage=0.9999; > select * from sample_07 order by salary LIMIT 999999999; > {code} > This will result in > {code} > Caused by: java.lang.OutOfMemoryError: Java heap space > at org.apache.hadoop.hive.ql.exec.TopNHash.initialize(TopNHash.java:113) > at > org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.initializeOp(ReduceSinkOperator.java:234) > at > org.apache.hadoop.hive.ql.exec.vector.VectorReduceSinkOperator.initializeOp(VectorReduceSinkOperator.java:68) > at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:385) > at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:469) > at > org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:425) > {code} > The basic issue lies with top n optimization. We need a limit for the top n > optimization. Ideally we would detect that the allocated bytes will be bigger > than the "limit.pushdown.memory.usage" without trying to alloc it. -- This message was sent by Atlassian JIRA (v6.3.4#6332)