[jira] [Updated] (HIVE-24318) When GlobalLimit is efficient, query will run twice with "Retry query with a different approach..."

libo (Jira) Fri, 30 Oct 2020 03:16:31 -0700


     [ 
https://issues.apache.org/jira/browse/HIVE-24318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


libo updated HIVE-24318:
------------------------
    Attachment:     (was: HIVE-24318.patch)

> When GlobalLimit is efficient, query will run twice with "Retry query with a 
> different approach..."
> ---------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-24318
>                 URL: https://issues.apache.org/jira/browse/HIVE-24318
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 2.0.1
>         Environment: Hadoop 2.6.0
> Hive-2.0.1
>            Reporter: libo
>            Assignee: libo
>            Priority: Minor
>         Attachments: HIVE-24318.patch
>
>
> hive.limit.optimize.enable=true
> hive.limit.row.max.size=10000000
> hive.limit.optimize.fetch.max=1000
> hive.fetch.task.conversion.threshold=25600000000
> hive.fetch.task.conversion=more
>  
> *sql eg:*
> select db_name,concat(tb_name,'test')  tbname from (select * from test1.t3 
> where dt='0909' limit 10)t1;
> (only partitioned table has this problem)
> *console information:*
> *......*
> Kill Command = /appcom/hadoop/bin/hadoop job -kill job_1600683831691_837491
> Hadoop job information for Stage-1: number of {color:#FF0000}mappers: 
> 1{color}; number of reducers: 1
>  map = 0%, reduce = 0%
>  map = 100%, reduce = 0%, Cumulative CPU 6.33 sec
>  map = 100%, reduce = 100%, Cumulative CPU 13.69 sec
> MapReduce Total cumulative CPU time: 13 seconds 690 msec
> Ended Job = job_1600683831691_837491
> MapReduce Jobs Launched:
> Stage-Stage-1: Map: 1 Reduce: 1 Cumulative CPU: 13.69 sec HDFS Read: 43833339 
> HDFS Write: 4115 SUCCESS
> Total MapReduce CPU Time Spent: 13 seconds 690 msec
> OK
> db_name      tbname 
> ......
> Retry query with a different approach...
> ......
>  Kill Command = /appcom/hadoop/bin/hadoop job -kill job_1600683831691_837520
> Hadoop job information for Stage-1: number of {color:#FF0000}mappers: 
> 176{color}; number of reducers: 1
> ......
> as we can see, the mr run twice，first time the global limit is efficient and 
> the second time is not
> *exception stack:*
> org.apache.hadoop.hive.ql.CommandNeedRetryException
>  at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:147)
>  at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:2022)
>  at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:317)
>  at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:232)
>  at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:475)
>  at org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:855)
>  at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:794)
>  at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:721)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>  at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498)
>  at org.apache.hadoop.util.RunJar.run(RunJar.java:323)
>  at org.apache.hadoop.util.RunJar.main(RunJar.java:236)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (HIVE-24318) When GlobalLimit is efficient, query will run twice with "Retry query with a different approach..."

Reply via email to