[ https://issues.apache.org/jira/browse/HIVE-21291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823881#comment-16823881 ]
Hive QA commented on HIVE-21291: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12966700/HIVE-21291.6.patch {color:green}SUCCESS:{color} +1 due to 4 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 20 failed/errored test(s), 15825 tests executed *Failed tests:* {noformat} TestCachedStore - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestCatalogCaching - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestDataSourceProviderFactory - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestDeadline - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestHiveMetaStoreGetMetaConf - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestHiveMetaStoreTimeout - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestHiveMetaStoreTxns - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMarkPartition - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMarkPartitionRemote - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMetaStoreEventListenerOnlyOnCommit - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMetaStoreEventListenerWithOldConf - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMetaStoreInitListener - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMetaStoreListenersError - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestMetaStoreSchemaInfo - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestObjectStore - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestObjectStoreSchemaMethods - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestRemoteHiveMetaStoreZKBindHost - did not produce a TEST-*.xml file (likely timed out) (batchId=231) TestTableIterable - did not produce a TEST-*.xml file (likely timed out) (batchId=231) org.apache.hadoop.hive.ql.parse.TestReplAcidTablesWithJsonMessage.testBootstrapAcidTablesDuringIncrementalWithConcurrentWrites (batchId=256) org.apache.hive.jdbc.TestMultiSessionsHS2WithLocalClusterSpark.org.apache.hive.jdbc.TestMultiSessionsHS2WithLocalClusterSpark (batchId=273) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/17010/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/17010/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-17010/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 20 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12966700 - PreCommit-HIVE-Build > Restore historical way of handling timestamps in Avro while keeping the new > semantics at the same time > ------------------------------------------------------------------------------------------------------ > > Key: HIVE-21291 > URL: https://issues.apache.org/jira/browse/HIVE-21291 > Project: Hive > Issue Type: Sub-task > Reporter: Zoltan Ivanfi > Assignee: Karen Coppage > Priority: Major > Attachments: HIVE-21291.1.patch, HIVE-21291.2.patch, > HIVE-21291.3.patch, HIVE-21291.4.patch, HIVE-21291.4.patch, > HIVE-21291.5.patch, HIVE-21291.6.patch > > > This sub-task is for implementing the Avro-specific parts of the following > plan: > h1. Problem > Historically, the semantics of the TIMESTAMP type in Hive depended on the > file format. Timestamps in Avro, Parquet and RCFiles with a binary SerDe had > _Instant_ semantics, while timestamps in ORC, textfiles and RCFiles with a > text SerDe had _LocalDateTime_ semantics. > The Hive community wanted to get rid of this inconsistency and have > _LocalDateTime_ semantics in Avro, Parquet and RCFiles with a binary SerDe as > well. *Hive 3.1 turned off normalization to UTC* to achieve this. While this > leads to the desired new semantics, it also leads to incorrect results when > new Hive versions read timestamps written by old Hive versions or when old > Hive versions or any other component not aware of this change (including > legacy Impala and Spark versions) read timestamps written by new Hive > versions. > h1. Solution > To work around this issue, Hive *should restore the practice of normalizing > to UTC* when writing timestamps to Avro, Parquet and RCFiles with a binary > SerDe. In itself, this would restore the historical _Instant_ semantics, > which is undesirable. In order to achieve the desired _LocalDateTime_ > semantics in spite of normalizing to UTC, newer Hive versions should record > the session-local local time zone in the file metadata fields serving > arbitrary key-value storage purposes. > When reading back files with this time zone metadata, newer Hive versions (or > any other new component aware of this extra metadata) can achieve > _LocalDateTime_ semantics by *converting from UTC to the saved time zone > (instead of to the local time zone)*. Legacy components that are unaware of > the new metadata can read the files without any problem and the timestamps > will show the historical Instant behaviour to them. -- This message was sent by Atlassian JIRA (v7.6.3#76005)