Brock Noland created HIVE-5000: ---------------------------------- Summary: hive.optimize.skewjoin can cause long running queries to fail Key: HIVE-5000 URL: https://issues.apache.org/jira/browse/HIVE-5000 Project: Hive Issue Type: Bug Reporter: Brock Noland Priority: Minor
{noformat} MapReduce Total cumulative CPU time: 5 days 19 hours 7 minutes 8 seconds 540 msec Ended Job = job_201301311513_15328 java.io.FileNotFoundException: File hdfs://xxxx:8020/tmp/hive-scripts/hive_2013-02-06_10-23-17_026_1520760778337129611/ -mr-10002/hive_skew_join_bigkeys_0 does not exist. at org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:406) at org.apache.hadoop.hive.ql.plan.ConditionalResolverSkewJoin.getTasks(ConditionalResolverSkewJoin.java:96) at org.apache.hadoop.hive.ql.exec.ConditionalTask.execute(ConditionalTask.java:81) at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:153) at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57) at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1331) at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1117) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:950) at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:258) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:215) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:406) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:341) at org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:439) at org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:449) at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:700) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:607) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:208) Ended Job = -1079843427, job is filtered out (removed at runtime). 8390065 Rows loaded to hdfs://xxxx/tmp/hive-scripts/hive_2013-02-06_10-23-17_026_1520760778337129611/-ext-10000 MapReduce Jobs Launched: Job 0: Map: 970 Reduce: 260 Cumulative CPU: 500828.54 sec HDFS Read: 0 HDFS Write: 0 SUCCESS Total MapReduce CPU Time Spent: 5 days 19 hours 7 minutes 8 seconds 540 msec OK {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira