Should hive-on-tez-conf.txt be added to the wiki, or is it not soup yet? -- Lefty
On Mon, Dec 16, 2013 at 10:25 AM, Cheolsoo Park <piaozhe...@gmail.com>wrote: > Closing the loop. We identified the issue with help from the Tez team. It > was mis-configured mapreduce.reduce.cpu.vcores that caused problems. > > If anyone who tries Hive on Tez with EMR Hadoop and sees that reducers are > stuck, > this<http://mail-archives.apache.org/mod_mbox/incubator-tez-user/201312.mbox/%3Cc3f953c465a3b2f4b02d62348ce67161%40mail.gmail.com%3E> > might > be helpful. > > Thanks, > Cheolsoo > > On Fri, Dec 13, 2013 at 4:51 PM, Gunther Hagleitner < > ghagleit...@hortonworks.com> wrote: > >> dev on bcc >> >> Zhenxiao, >> >> Cool you got it set up. >> >> The query runs a full order by before the limit - are you sure it's not >> just still running? Hive on Tez prints "total tasks/completed tasks", so no >> update may mean none of the reduce tasks have finished yet. >> >> If not, it'd be great to see the yarn logs (yarn logs -applicationId) and >> get more info about the table you're using (size, file format, etc). If the >> logs are really big you might want to consider opening/attaching them to a >> jira (issues.apache.org) (or send them directly to me). >> >> There are a bunch of settings that might be of interest to you (in >> general not just for this query) - I've attached a text doc with some >> details. >> >> Thanks, >> Gunther. >> >> >> >> >> On Fri, Dec 13, 2013 at 1:12 PM, Zhenxiao Luo <z...@netflix.com> wrote: >> >>> Hi, >>> >>> Excuse me. May I ask a question about running Hive on Tez? >>> >>> I've installed Hive on Tez, and running a simple query from hiveCli, >>> >>> hive> set hive.optimize.tez=true; >>> hive> select * from table order by title_id limit 5; >>> >>> While, each time, I could see from the TezJobMonitor that, all the map >>> jobs are done, but the reducer never get started, and the job is >>> running forever there. >>> >>> I tried a number of times, and each time the same failure(job running >>> hangs) happens again and again. >>> Does anyone successfully running queries using Hive on Tez? Are there >>> any tips or suggestions? >>> >>> Here is my job log: >>> >>> 13/12/13 20:57:31 INFO client.TezSession: Submitting dag to >>> TezSession, sessionName=HIVE-365b35bc-2461-4e2f-83f9-8da1fa356a86, >>> applicationId=application_1386896881353_0027 >>> 13/12/13 20:57:33 INFO client.TezSession: Submitted dag to TezSession, >>> sessionName=HIVE-365b35bc-2461-4e2f-83f9-8da1fa356a86, >>> applicationId=application_1386896881353_0027, >>> dagId=dag_1386896881353_0027_1 >>> 13/12/13 20:57:33 INFO client.RMProxy: Connecting to ResourceManager >>> at /10.183.195.180:9022 >>> 13/12/13 20:57:33 INFO log.PerfLogger: </PERFLOG method=TezSubmitDag >>> start=1386968251250 end=1386968253338 duration=2088 >>> from=org.apache.hadoop.hive.ql.exec.tez.TezTask> >>> >>> >>> 13/12/13 20:57:33 INFO tez.TezJobMonitor: >>> >>> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG method=TezRunDag >>> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >>> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG >>> method=TezSubmitToRunningDag >>> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >>> 13/12/13 20:57:33 INFO log.PerfLogger: </PERFLOG >>> method=TezSubmitToRunningDag start=1386968253341 end=1386968253402 >>> duration=61 from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >>> Status: Running (application id: application_1386896881353_0027) >>> >>> 13/12/13 20:57:33 INFO tez.TezJobMonitor: Status: Running (application >>> id: application_1386896881353_0027) >>> >>> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG >>> method=TezRunVertex.Reducer 2 >>> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >>> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG >>> method=TezRunVertex.Map 1 >>> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >>> Map 1: -/- Reducer 2: -/- >>> 13/12/13 20:57:33 INFO tez.TezJobMonitor: Map 1: -/- Reducer 2: -/- >>> Map 1: -/- Reducer 2: 0/1 >>> 13/12/13 20:57:33 INFO tez.TezJobMonitor: Map 1: -/- Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:34 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:37 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:40 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:43 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:46 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:49 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:52 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 0/16 Reducer 2: 0/1 >>> 13/12/13 20:57:55 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >>> Map 1: 1/16 Reducer 2: 0/1 >>> 13/12/13 20:57:56 INFO tez.TezJobMonitor: Map 1: 1/16 Reducer 2: 0/1 >>> Map 1: 2/16 Reducer 2: 0/1 >>> 13/12/13 20:57:58 INFO tez.TezJobMonitor: Map 1: 2/16 Reducer 2: 0/1 >>> Map 1: 3/16 Reducer 2: 0/1 >>> 13/12/13 20:57:58 INFO tez.TezJobMonitor: Map 1: 3/16 Reducer 2: 0/1 >>> Map 1: 5/16 Reducer 2: 0/1 >>> 13/12/13 20:57:59 INFO tez.TezJobMonitor: Map 1: 5/16 Reducer 2: 0/1 >>> Map 1: 8/16 Reducer 2: 0/1 >>> 13/12/13 20:57:59 INFO tez.TezJobMonitor: Map 1: 8/16 Reducer 2: 0/1 >>> Map 1: 12/16 Reducer 2: 0/1 >>> 13/12/13 20:57:59 INFO tez.TezJobMonitor: Map 1: 12/16 Reducer 2: 0/1 >>> Map 1: 15/16 Reducer 2: 0/1 >>> 13/12/13 20:58:00 INFO tez.TezJobMonitor: Map 1: 15/16 Reducer 2: 0/1 >>> 13/12/13 20:58:00 INFO log.PerfLogger: </PERFLOG >>> method=TezRunVertex.Map 1 start=1386968253402 end=1386968280223 >>> duration=26821 from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:00 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:03 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:06 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:09 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:12 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:15 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:18 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:21 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:24 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:27 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:30 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:33 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:36 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:39 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:42 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:45 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:48 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:51 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:54 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:58:57 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:00 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:03 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:06 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:09 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:12 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:15 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:18 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:21 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:24 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:27 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:30 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:33 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:36 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:39 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:42 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:45 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:48 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:52 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:55 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 20:59:58 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 21:00:01 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 21:00:04 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 21:00:07 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 21:00:10 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> Map 1: 16/16 Reducer 2: 0/1 >>> 13/12/13 21:00:13 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >>> >>> Thanks, >>> Zhenxiao >>> >> >> >> CONFIDENTIALITY NOTICE >> NOTICE: This message is intended for the use of the individual or entity >> to which it is addressed and may contain information that is confidential, >> privileged and exempt from disclosure under applicable law. If the reader >> of this message is not the intended recipient, you are hereby notified that >> any printing, copying, dissemination, distribution, disclosure or >> forwarding of this communication is strictly prohibited. If you have >> received this communication in error, please contact the sender immediately >> and delete it from your system. Thank You. > > >