[ https://issues.apache.org/jira/browse/HIVE-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13793280#comment-13793280 ]
jeff little commented on HIVE-5245: ----------------------------------- Hi, Yin Huai. What is going on next step "Can you try the trunk?" I deem that the middle join results of join operator may not be saved and not be written to the temp HDFS. In other words, it may be failure, like 'Stage-7 is filtered out by condition resolver'. The other problem that we encountered recently is like below: hive (test)> select a.* from test_01 a > join (select b.id from test_02 b > join test_03 c > on (b.id =c.id)) d > on (a.id=d.id); Total MapReduce jobs = 4 setting HADOOP_USER_NAME hadoop Execution log at: /tmp/hadoop/.log 2013-10-12 02:36:42 Starting to launch local task to process map join; maximum memory = 932118528 2013-10-12 02:36:43 Processing rows: 4 Hashtable size: 4 Memory usage: 110930744 rate: 0.119 2013-10-12 02:36:43 Dump the hashtable into file: file:/tmp/hadoop/hive_2013-10-12_14-36-40_657_1301190087196742169/-local-10011/HashTable-Stage-9/MapJoin-mapfile41--.hashtable 2013-10-12 02:36:43 Upload 1 File to: file:/tmp/hadoop/hive_2013-10-12_14-36-40_657_1301190087196742169/-local-10011/HashTable-Stage-9/MapJoin-mapfile41--.hashtable File size: 444 2013-10-12 02:36:43 End of local task; Time Taken: 0.413 sec. Execution completed successfully Mapred Local Task Succeeded . Convert the Join into MapJoin Mapred Local Task Succeeded . Convert the Join into MapJoin Launching Job 1 out of 4 Number of reduce tasks is set to 0 since there's no reduce operator Starting Job = job_201308241420_4028, Tracking URL = http://namenode:50030/jobdetails.jsp?jobid=job_201308241420_4028 Kill Command = /home/hadoop/package/hadoop-1.0.4/libexec/../bin/hadoop job -kill job_201308241420_4028 Hadoop job information for Stage-9: number of mappers: 2; number of reducers: 0 2013-10-12 14:36:58,185 Stage-9 map = 0%, reduce = 0% 2013-10-12 14:37:04,207 Stage-9 map = 100%, reduce = 0%, Cumulative CPU 2.66 sec 2013-10-12 14:37:05,213 Stage-9 map = 100%, reduce = 0%, Cumulative CPU 2.66 sec 2013-10-12 14:37:06,218 Stage-9 map = 100%, reduce = 0%, Cumulative CPU 2.66 sec 2013-10-12 14:37:07,223 Stage-9 map = 100%, reduce = 0%, Cumulative CPU 2.66 sec 2013-10-12 14:37:08,228 Stage-9 map = 100%, reduce = 0%, Cumulative CPU 2.66 sec 2013-10-12 14:37:09,232 Stage-9 map = 100%, reduce = 0%, Cumulative CPU 2.66 sec 2013-10-12 14:37:10,237 Stage-9 map = 100%, reduce = 100%, Cumulative CPU 2.66 sec MapReduce Total cumulative CPU time: 2 seconds 660 msec Ended Job = job_201308241420_4028 Stage-12 is filtered out by condition resolver. MapReduce Jobs Launched: Job 0: Map: 2 Cumulative CPU: 2.66 sec HDFS Read: 822 HDFS Write: 2190 SUCCESS Total MapReduce CPU Time Spent: 2 seconds 660 msec OK Time taken: 29.662 seconds hive (test)> Note: the table of test_01, test_02 and test_03 have data, and have the same values of id, but we can't get results. Inversely, it returns nothing. The problem may also be caused by “Stage-12 is filtered out by condition resolver”. > hive create table as select(CTAS) can not work(not support) with join on > operator > --------------------------------------------------------------------------------- > > Key: HIVE-5245 > URL: https://issues.apache.org/jira/browse/HIVE-5245 > Project: Hive > Issue Type: Bug > Components: HiveServer2 > Affects Versions: 0.11.0 > Reporter: jeff little > Labels: CTAS, hive > Original Estimate: 96h > Remaining Estimate: 96h > > hello everyone, recently i came across one hive problem as below: > hive (test)> create table test_09 as > > select a.* from test_01 a > > join test_02 b > > on (a.id=b.id); > Automatically selecting local only mode for query > Total MapReduce jobs = 2 > setting HADOOP_USER_NAME hadoop > 13/09/09 17:22:36 WARN conf.Configuration: > file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10008/jobconf.xml:a > attempt to override final parameter: mapred.system.dir; Ignoring. > 13/09/09 17:22:36 WARN conf.Configuration: > file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10008/jobconf.xml:a > attempt to override final parameter: mapred.local.dir; Ignoring. > Execution log at: /tmp/hadoop/.log > 2013-09-09 05:22:36 Starting to launch local task to process map join; > maximum memory = 932118528 > 2013-09-09 05:22:37 Processing rows: 4 Hashtable size: 4 > Memory usage: 113068056 rate: 0.121 > 2013-09-09 05:22:37 Dump the hashtable into file: > file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10005/HashTable-Stage-6/MapJoin-mapfile90--.hashtable > 2013-09-09 05:22:37 Upload 1 File to: > file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10005/HashTable-Stage-6/MapJoin-mapfile90--.hashtable > File size: 788 > 2013-09-09 05:22:37 End of local task; Time Taken: 0.444 sec. > Execution completed successfully > Mapred Local Task Succeeded . Convert the Join into MapJoin > Mapred Local Task Succeeded . Convert the Join into MapJoin > Launching Job 1 out of 2 > Number of reduce tasks is set to 0 since there's no reduce operator > 13/09/09 17:22:38 WARN conf.Configuration: > file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10009/jobconf.xml:a > attempt to override final parameter: mapred.system.dir; Ignoring. > 13/09/09 17:22:38 WARN conf.Configuration: > file:/tmp/hadoop/hive_2013-09-09_17-22-34_848_1629553341892012305/-local-10009/jobconf.xml:a > attempt to override final parameter: mapred.local.dir; Ignoring. > Execution log at: /tmp/hadoop/.log > Job running in-process (local Hadoop) > Hadoop job information for null: number of mappers: 0; number of reducers: 0 > 2013-09-09 17:22:41,807 null map = 0%, reduce = 0% > 2013-09-09 17:22:44,814 null map = 100%, reduce = 0% > Ended Job = job_local_0001 > Execution completed successfully > Mapred Local Task Succeeded . Convert the Join into MapJoin > Stage-7 is filtered out by condition resolver. > OK > Time taken: 13.138 seconds > hive (test)> select * from test_09; > FAILED: SemanticException [Error 10001]: Line 1:14 Table not found 'test_09' > hive (test)> > Problem: > I can't get the created table, namely this CTAS is nonavailable, and this > table is not created by this hql sentence at all.who can explain for > me.Thanks. -- This message was sent by Atlassian JIRA (v6.1#6144)