[ https://issues.apache.org/jira/browse/HIVE-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14378406#comment-14378406 ]
Lefty Leverenz commented on HIVE-8746: -------------------------------------- Should this fix be documented in the wiki? If so, here are two places it could go: * [ORC Files -- Column Encodings -- Timestamp Columns | https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC#LanguageManualORC-TimestampColumns] * [Hive Data Types -- Timestamps | https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Types#LanguageManualTypes-Timestamps] Or maybe the ORC doc should have a small section on bug fixes, for higher visibility. > ORC timestamp columns are sensitive to daylight savings time > ------------------------------------------------------------ > > Key: HIVE-8746 > URL: https://issues.apache.org/jira/browse/HIVE-8746 > Project: Hive > Issue Type: Bug > Affects Versions: 0.11.0, 0.12.0, 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0 > Reporter: Owen O'Malley > Assignee: Prasanth Jayachandran > Labels: orcfile > Fix For: 1.2.0 > > Attachments: HIVE-8746.1.patch, HIVE-8746.2.patch, HIVE-8746.3.patch, > HIVE-8746.4.patch > > > Hive uses Java's Timestamp class to manipulate timestamp columns. > Unfortunately the textual parsing in Timestamp is done in local time and the > internal storage is in UTC. > ORC mostly side steps this issue by storing the difference between the time > and a base time also in local and storing that difference in the file. > Reading the file between timezones will mostly work correctly "2014-01-01 > 12:34:56" will read correctly in every timezone. > However, when moving between timezones with different daylight saving it > creates trouble. In particular, moving from a computer in PST to UTC will > read "2014-06-06 12:34:56" as "2014-06-06 11:34:56". -- This message was sent by Atlassian JIRA (v6.3.4#6332)