[ https://issues.apache.org/jira/browse/HIVE-21015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16712957#comment-16712957 ]
Peter Vary commented on HIVE-21015: ----------------------------------- +1 > HCatLoader can't provide statistics for tables no in default DB > --------------------------------------------------------------- > > Key: HIVE-21015 > URL: https://issues.apache.org/jira/browse/HIVE-21015 > Project: Hive > Issue Type: Bug > Reporter: Adam Szita > Assignee: Adam Szita > Priority: Major > Attachments: HIVE-21015.0.patch, HIVE-21015.1.patch, > HIVE-21015.2.patch, HIVE-21015.3.patch > > > This is due to a former change (HIVE-20330) that does not take database into > consideration when retrieving the proper InputJobInfo for the loader. > Found during testing: > {code:java} > 07:52:56 2018-12-05 07:52:16,599 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator > - Couldn't get statistics from LoadFunc: > org.apache.hive.hcatalog.pig.HCatLoader@492fa72a > 07:52:56 java.io.IOException: java.io.IOException: Could not calculate input > size for location (table) tpcds_3000_decimal_parquet.date_dim > 07:52:56 at > org.apache.hive.hcatalog.pig.HCatLoader.getStatistics(HCatLoader.java:281) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.getInputSizeFromLoader(InputSizeReducerEstimator.java:171) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.getTotalInputFileSize(InputSizeReducerEstimator.java:118) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.getTotalInputFileSize(InputSizeReducerEstimator.java:97) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.InputSizeReducerEstimator.estimateNumberOfReducers(InputSizeReducerEstimator.java:80) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.estimateNumberOfReducers(JobControlCompiler.java:1148) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.calculateRuntimeReducers(JobControlCompiler.java:1115) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.adjustNumReducers(JobControlCompiler.java:1063) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.getJob(JobControlCompiler.java:564) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.compile(JobControlCompiler.java:333) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:221) > 07:52:56 at > org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.launchPig(HExecutionEngine.java:293) > 07:52:56 at org.apache.pig.PigServer.launchPlan(PigServer.java:1475) > 07:52:56 at > org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1460) > 07:52:56 at org.apache.pig.PigServer.storeEx(PigServer.java:1119) > 07:52:56 at org.apache.pig.PigServer.store(PigServer.java:1082) > 07:52:56 at org.apache.pig.PigServer.openIterator(PigServer.java:995) > 07:52:56 at > org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:782) > 07:52:56 at > org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:383) > 07:52:56 at > org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:230) > 07:52:56 at > org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:205) > 07:52:56 at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:81) > 07:52:56 at org.apache.pig.Main.run(Main.java:630) > 07:52:56 at org.apache.pig.Main.main(Main.java:175) > 07:52:56 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > 07:52:56 at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > 07:52:56 at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > 07:52:56 at java.lang.reflect.Method.invoke(Method.java:498) > 07:52:56 at org.apache.hadoop.util.RunJar.run(RunJar.java:313) > 07:52:56 at org.apache.hadoop.util.RunJar.main(RunJar.java:227) > 07:52:56 Caused by: java.io.IOException: Could not calculate input size for > location (table) tpcds_3000_decimal_parquet.date_dim > 07:52:56 at > org.apache.hive.hcatalog.pig.HCatLoader.getStatistics(HCatLoader.java:276) > 07:52:56 ... 29 more{code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)