[ https://issues.apache.org/jira/browse/HIVE-14987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16578503#comment-16578503 ]
Kristopher Kane commented on HIVE-14987: ---------------------------------------- Can see this problem on Hive 1.2.1 Was it fixed in later versions? > CombineHiveInputFormat with Tez fails to initiate vertex if table is empty > -------------------------------------------------------------------------- > > Key: HIVE-14987 > URL: https://issues.apache.org/jira/browse/HIVE-14987 > Project: Hive > Issue Type: Bug > Reporter: Yi Zhang > Priority: Major > > Sometimes user have developed custom inputformat that extends from > CombineHiveInputFormat due to difficulty of extending from HiveInputFormat > directly, for example to filter out old data files. > in this use case, vertex fails to get initialized: > SELECT city.cid > FROM > (select city_id as cid, > row_number() over(partition by timezone order by population) rnum > from cities) city > JOIN > (select datestr, id from yizhang.emptyparts where datestr >= > date_sub(current_date(),30)) emp > on city.cid = emp.id > ; > -------------------------------------------------------------------------------- > VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED > KILLED > -------------------------------------------------------------------------------- > Map 1 KILLED -1 0 0 -1 0 > 0 > Map 3 FAILED -1 0 0 -1 0 > 0 > Reducer 2 KILLED 1 0 0 1 0 > 0 > -------------------------------------------------------------------------------- > VERTICES: 00/03 [>>--------------------------] 0% ELAPSED TIME: 0.34 s > > -------------------------------------------------------------------------------- > Status: Failed > Vertex failed, vertexName=Map 3, vertexId=vertex_1476217616538_398108_1_01, > diagnostics=[Vertex vertex_1476217616538_398108_1_01 [Map 3] killed/failed > due to:ROOT_INPUT_INIT_FAILURE, Vertex Input: emp initializer failed, > vertex=vertex_1476217616538_398108_1_01 [Map 3], > java.lang.IllegalArgumentException > at > java.util.concurrent.ThreadPoolExecutor.<init>(ThreadPoolExecutor.java:1307) > at > java.util.concurrent.ThreadPoolExecutor.<init>(ThreadPoolExecutor.java:1195) > at java.util.concurrent.Executors.newFixedThreadPool(Executors.java:89) > at > org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:519) > at > org.apache.tez.mapreduce.hadoop.MRInputHelpers.generateOldSplits(MRInputHelpers.java:447) > at > org.apache.tez.mapreduce.hadoop.MRInputHelpers.generateInputSplitsToMem(MRInputHelpers.java:299) > at > org.apache.tez.mapreduce.common.MRInputAMSplitGenerator.initialize(MRInputAMSplitGenerator.java:121) > at > org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:264) > at > org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:258) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671) > at > org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable.call(RootInputInitializerManager.java:258) > at > org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable.call(RootInputInitializerManager.java:245) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > ] -- This message was sent by Atlassian JIRA (v7.6.3#76005)