[ https://issues.apache.org/jira/browse/HIVE-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Samuel Yuan updated HIVE-4199: ------------------------------ Status: Patch Available (was: Open) > ORC writer doesn't handle non-UTF8 encoded Text properly > -------------------------------------------------------- > > Key: HIVE-4199 > URL: https://issues.apache.org/jira/browse/HIVE-4199 > Project: Hive > Issue Type: Bug > Components: Serializers/Deserializers > Reporter: Samuel Yuan > Assignee: Samuel Yuan > Priority: Minor > Attachments: HIVE-4199.HIVE-4199.HIVE-4199.D9501.1.patch, > HIVE-4199.HIVE-4199.HIVE-4199.D9501.2.patch, > HIVE-4199.HIVE-4199.HIVE-4199.D9501.3.patch > > > StringTreeWriter currently converts fields stored as Text objects into > Strings. This can lose information (see > http://en.wikipedia.org/wiki/Replacement_character#Replacement_character), > and is also unnecessary since the dictionary stores Text objects. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira