[jira] [Commented] (HIVE-5245) hive create table as select(CTAS) can not work(not support) with join on operator

jeff little (JIRA) Fri, 11 Oct 2013 23:45:08 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13793280#comment-13793280
 ]


jeff little commented on HIVE-5245:
-----------------------------------

Hi, Yin Huai.  What is going on next step "Can you try the trunk?"
I deem that the middle join results of join operator may not be saved and not 
be written to the temp HDFS. In other words, it may be failure, like 'Stage-7 
is filtered out by condition resolver'. 
The other problem that we encountered recently is like below:
hive (test)> select a.* from test_01 a
           > join (select b.id from test_02 b
           > join test_03 c
           > on (b.id =c.id)) d
           > on (a.id=d.id);
Total MapReduce jobs = 4
setting HADOOP_USER_NAME        hadoop
Execution log at: /tmp/hadoop/.log
2013-10-12 02:36:42     Starting to launch local task to process map join;      
maximum memory = 932118528
2013-10-12 02:36:43     Processing rows:        4       Hashtable size: 4       
Memory usage:   110930744       rate:   0.119
2013-10-12 02:36:43     Dump the hashtable into file: 
file:/tmp/hadoop/hive_2013-10-12_14-36-40_657_1301190087196742169/-local-10011/HashTable-Stage-9/MapJoin-mapfile41--.hashtable
2013-10-12 02:36:43     Upload 1 File to: 
file:/tmp/hadoop/hive_2013-10-12_14-36-40_657_1301190087196742169/-local-10011/HashTable-Stage-9/MapJoin-mapfile41--.hashtable
 File size: 444
2013-10-12 02:36:43     End of local task; Time Taken: 0.413 sec.
Execution completed successfully
Mapred Local Task Succeeded . Convert the Join into MapJoin
Mapred Local Task Succeeded . Convert the Join into MapJoin
Launching Job 1 out of 4
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_201308241420_4028, Tracking URL = 
http://namenode:50030/jobdetails.jsp?jobid=job_201308241420_4028
Kill Command = /home/hadoop/package/hadoop-1.0.4/libexec/../bin/hadoop job  
-kill job_201308241420_4028
Hadoop job information for Stage-9: number of mappers: 2; number of reducers: 0
2013-10-12 14:36:58,185 Stage-9 map = 0%,  reduce = 0%
2013-10-12 14:37:04,207 Stage-9 map = 100%,  reduce = 0%, Cumulative CPU 2.66 
sec
2013-10-12 14:37:05,213 Stage-9 map = 100%,  reduce = 0%, Cumulative CPU 2.66 
sec
2013-10-12 14:37:06,218 Stage-9 map = 100%,  reduce = 0%, Cumulative CPU 2.66 
sec
2013-10-12 14:37:07,223 Stage-9 map = 100%,  reduce = 0%, Cumulative CPU 2.66 
sec
2013-10-12 14:37:08,228 Stage-9 map = 100%,  reduce = 0%, Cumulative CPU 2.66 
sec
2013-10-12 14:37:09,232 Stage-9 map = 100%,  reduce = 0%, Cumulative CPU 2.66 
sec
2013-10-12 14:37:10,237 Stage-9 map = 100%,  reduce = 100%, Cumulative CPU 2.66 
sec
MapReduce Total cumulative CPU time: 2 seconds 660 msec
Ended Job = job_201308241420_4028
Stage-12 is filtered out by condition resolver.
MapReduce Jobs Launched:
Job 0: Map: 2   Cumulative CPU: 2.66 sec   HDFS Read: 822 HDFS Write: 2190 
SUCCESS
Total MapReduce CPU Time Spent: 2 seconds 660 msec
OK
Time taken: 29.662 seconds
hive (test)>

Note： the table of test_01, test_02 and test_03 have data, and have the same 
values of id, but we can't get results. Inversely, it returns nothing. The 
problem may also be caused by “Stage-12 is filtered out by condition resolver”. 

> hive create table as select(CTAS) can not work(not support) with join on 
> operator
> ---------------------------------------------------------------------------------
>
>                 Key: HIVE-5245
>                 URL: https://issues.apache.org/jira/browse/HIVE-5245
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2
>    Affects Versions: 0.11.0
>            Reporter: jeff little
>              Labels: CTAS, hive
>   Original Estimate: 96h
>  Remaining Estimate: 96h
>
> hello everyone, recently i came across one hive problem as below:
> hive (test)> create table test_09 as
>            > select a.* from test_01 a
>            > join test_02 b
>            > on (a.id=b.id);
> Automatically selecting local only mode for query
> Total MapReduce jobs = 2
> setting HADOOP_USER_NAME        hadoop
> 13/09/09 17:22:36 WARN conf.Configuration: 
> file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10008/jobconf.xml:a
>  attempt to override final parameter: mapred.system.dir;  Ignoring.
> 13/09/09 17:22:36 WARN conf.Configuration: 
> file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10008/jobconf.xml:a
>  attempt to override final parameter: mapred.local.dir;  Ignoring.
> Execution log at: /tmp/hadoop/.log
> 2013-09-09 05:22:36     Starting to launch local task to process map join;    
>   maximum memory = 932118528
> 2013-09-09 05:22:37     Processing rows:        4       Hashtable size: 4     
>   Memory usage:   113068056       rate:   0.121
> 2013-09-09 05:22:37     Dump the hashtable into file: 
> file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10005/HashTable-Stage-6/MapJoin-mapfile90--.hashtable
> 2013-09-09 05:22:37     Upload 1 File to: 
> file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10005/HashTable-Stage-6/MapJoin-mapfile90--.hashtable
>  File size: 788
> 2013-09-09 05:22:37     End of local task; Time Taken: 0.444 sec.
> Execution completed successfully
> Mapred Local Task Succeeded . Convert the Join into MapJoin
> Mapred Local Task Succeeded . Convert the Join into MapJoin
> Launching Job 1 out of 2
> Number of reduce tasks is set to 0 since there's no reduce operator
> 13/09/09 17:22:38 WARN conf.Configuration: 
> file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10009/jobconf.xml:a
>  attempt to override final parameter: mapred.system.dir;  Ignoring.
> 13/09/09 17:22:38 WARN conf.Configuration: 
> file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10009/jobconf.xml:a
>  attempt to override final parameter: mapred.local.dir;  Ignoring.
> Execution log at: /tmp/hadoop/.log
> Job running in-process (local Hadoop)
> Hadoop job information for null: number of mappers: 0; number of reducers: 0
> 2013-09-09 17:22:41,807 null map = 0%,  reduce = 0%
> 2013-09-09 17:22:44,814 null map = 100%,  reduce = 0%
> Ended Job = job_local_0001
> Execution completed successfully
> Mapred Local Task Succeeded . Convert the Join into MapJoin
> Stage-7 is filtered out by condition resolver.
> OK
> Time taken: 13.138 seconds
> hive (test)> select * from test_09;
> FAILED: SemanticException [Error 10001]: Line 1:14 Table not found 'test_09'
> hive (test)>
> Problem:
> I can't get the created table, namely this CTAS is nonavailable, and this 
> table is not created by this hql sentence at all.who can explain for 
> me.Thanks.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5245) hive create table as select(CTAS) can not work(not support) with join on operator

Reply via email to