[ https://issues.apache.org/jira/browse/HIVE-7393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Navis resolved HIVE-7393. ------------------------- Resolution: Duplicate Fix Version/s: 0.14.0 Already fixed by HIVE-7011 > Tez jobs sometimes fail with NPE processing input splits > -------------------------------------------------------- > > Key: HIVE-7393 > URL: https://issues.apache.org/jira/browse/HIVE-7393 > Project: Hive > Issue Type: Bug > Components: Tez > Affects Versions: 0.13.0 > Reporter: Steven Yu > Fix For: 0.14.0 > > Attachments: syslog_dag_1405114778353_0004_1.txt > > > Input files are either ORC or RC format. Only occurs on occasion - if the > query is repeated it is likely to complete successfully. > {noformat} > 2014-07-11 15:31:45,367 INFO [InputInitializer [Map 3] #0] > org.apache.hadoop.mapred.split.TezMapredSplitsGrouper: Grouping splits in Tez > 2014-07-11 15:31:45,367 INFO [InputInitializer [Map 3] #0] > org.apache.hadoop.mapred.split.TezMapredSplitsGrouper: Desired splits: 408 > too large. Desired splitLength: 614866 Min splitLength: 16777216 New desired > splits: 15 Total length: 250865685 Original splits: 13 > 2014-07-11 15:31:45,367 INFO [InputInitializer [Map 3] #0] > org.apache.hadoop.mapred.split.TezMapredSplitsGrouper: Using original number > of splits: 13 desired splits: 15 > 2014-07-11 15:31:45,381 INFO [AsyncDispatcher event handler] > org.apache.tez.dag.history.HistoryEventHandler: > [HISTORY][DAG:dag_1405114778353_0004_1][Event:VERTEX_INITIALIZED]: > vertexName=Reducer 4, vertexId=vertex_1405114778353_0004_1_09, > initRequestedTime=1405117905313, initedTime=1405117905381, numTasks=999, > processorName=org.apache.hadoop.hive.ql.exec.tez.ReduceTezProcessor, > additionalInputsCount=0 > 2014-07-11 15:31:45,381 INFO [AsyncDispatcher event handler] > org.apache.tez.dag.app.dag.impl.VertexImpl: vertex_1405114778353_0004_1_09 > [Reducer 4] transitioned from NEW to INITED due to event V_INIT > 2014-07-11 15:31:45,383 ERROR [AsyncDispatcher event handler] > org.apache.tez.dag.app.dag.impl.VertexImpl: Vertex Input: csb initializer > failed > java.lang.NullPointerException > at > org.apache.hadoop.hive.ql.io.HiveInputFormat.addSplitsForGroup(HiveInputFormat.java:275) > at > org.apache.hadoop.hive.ql.io.HiveInputFormat.getSplits(HiveInputFormat.java:372) > at > org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat.getSplits(TezGroupedSplitsInputFormat.java:68) > at > org.apache.tez.mapreduce.hadoop.MRHelpers.generateOldSplits(MRHelpers.java:263) > at > org.apache.tez.mapreduce.common.MRInputAMSplitGenerator.initialize(MRInputAMSplitGenerator.java:139) > at > org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable$1.run(RootInputInitializerRunner.java:154) > at > org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable$1.run(RootInputInitializerRunner.java:146) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557) > at > org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable.call(RootInputInitializerRunner.java:146) > at > org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable.call(RootInputInitializerRunner.java:114) > at java.util.concurrent.FutureTask.run(FutureTask.java:262) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:744) > {noformat} -- This message was sent by Atlassian JIRA (v6.2#6252)