[
https://issues.apache.org/jira/browse/HIVE-7955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14202958#comment-14202958
]
Thomas Friedrich commented on HIVE-7955:
----------------------------------------
The test hook_context_cs fails while checking for expected value in
VerifyContentSummaryCacheHook:
TestSparkCliDriver.testCliDriver_hook_context_cs:120->runTest:146 Unexpected
exception junit.framework.AssertionFailedError: expected:<1> but was:<0>
at junit.framework.Assert.fail(Assert.java:57)
at junit.framework.Assert.failNotEquals(Assert.java:329)
at junit.framework.Assert.assertEquals(Assert.java:78)
at junit.framework.Assert.assertEquals(Assert.java:234)
at junit.framework.Assert.assertEquals(Assert.java:241)
at
org.apache.hadoop.hive.ql.hooks.VerifyContentSummaryCacheHook.run(VerifyContentSummaryCacheHook.java:34)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1512)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1216)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1043)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1033)
The problem is that the post-hook check in
org.apache.hadoop.hive.ql.hooks.VerifyContentSummaryCacheHook fails because
inputToCS.size is 0 instead of 1:
Assert.assertEquals(1, inputToCS.size());
I ran the same test with TestCliDriver where Hive will create MapRedTask
objects. For a MapRedTask, the hookContext.getInputPathToContentSummary() is
set in the setNumberOfReducers method, line 400:
inputSummary = Utilities.getInputSummary(driverContext.getCtx(),
work.getMapWork(), null);
(the path is added in Utilities.getInputSummary(Context, MapWork, PathFilter)
line: 2584).
Since we have a SparkTask, the array will be empty, hence the check for
length==1 fails.
It looks somewhat related to HIVE-8504 where the test case also configures a
hive.exec.post.hooks, but since it's a different post-hook class altogether, I
opened a separate JIRA HIVE-8790.
Maybe something needs to be added to the SparkTask.
> Investigate query failures (4)
> ------------------------------
>
> Key: HIVE-7955
> URL: https://issues.apache.org/jira/browse/HIVE-7955
> Project: Hive
> Issue Type: Sub-task
> Components: Spark
> Reporter: Brock Noland
> Assignee: Thomas Friedrich
>
> I ran all q-file tests and the following failed with an exception:
> http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/HIVE-SPARK-ALL-TESTS-Build/lastCompletedBuild/testReport/
> we don't necessary want to run all these tests as part of the spark tests,
> but we should understand why they failed with an exception. This JIRA is to
> look into these failures and document them with one of:
> * New JIRA
> * Covered under existing JIRA
> * More investigation required
> Tests:
> {noformat}
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_dynpart_sort_optimization
> 12 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_schemeAuthority2
> 0.23 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_load_dyn_part8
> 10 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_auto_sortmerge_join_4
> 11 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_orc_analyze
> 8 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_tez_join_hash
> 0.98 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_hook_context_cs
> 2.1 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_insert_overwrite_local_directory_1
> 3.7 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_archive_excludeHadoop20
> 27 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_auto_sortmerge_join_9
> 8.2 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_limit_partition_metadataonly
> 0.77 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_bucket_num_reducers2
> 7 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_groupby_bigdata
> 0.6 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_bucketsortoptimize_insert_6
> 6.6 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_smb_mapjoin_25
> 2.6 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_dbtxnmgr_query3
> 0.48 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_auto_sortmerge_join_16
> 8.5 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_empty_dir_in_table
> 2.6 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_input33 1.3 sec
> 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_authorization_admin_almighty1
> 2.8 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_udf_context_aware
> 0.23 sec 2
>
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_authorization_view_sqlstd
> 4.1 sec 2
> org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver_smb_mapjoin_12
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)