[ https://issues.apache.org/jira/browse/HIVE-2137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kousuke Saruta updated HIVE-2137: --------------------------------- Description: JDBC driver for HiveServer1 decodes string by client side default encoding, which depends on operating system unless we don't specify another encoding. It ignore server side encoding. For example, when server side operating system and encoding are Linux (utf-8) and client side operating system and encoding are Windows (shift-jis : it's japanese charset, makes character corruption happens in the client. In current implementation of Hive, UTF-8 appears to be expected in server side so client side should encode/decode string as UTF-8. was: JDBC driver decodes string by client side default encoding, which depends on operating system unless we don't specify another encoding. It ignore server side encoding. For example, when server side operating system and encoding are Linux (utf-8) and client side operating system and encoding are Windows (shift-jis : it's japanese charset, makes character corruption happens in the client. In current implementation of Hive, UTF-8 appears to be expected in server side so client side should encode/decode string as UTF-8. > JDBC driver doesn't encode string properly. > ------------------------------------------- > > Key: HIVE-2137 > URL: https://issues.apache.org/jira/browse/HIVE-2137 > Project: Hive > Issue Type: Bug > Components: JDBC > Affects Versions: 0.9.0 > Reporter: Jin Adachi > Fix For: 0.12.0 > > Attachments: HIVE-2137.patch > > > JDBC driver for HiveServer1 decodes string by client side default encoding, > which depends on operating system unless we don't specify another encoding. > It ignore server side encoding. > For example, > when server side operating system and encoding are Linux (utf-8) and client > side operating system and encoding are Windows (shift-jis : it's japanese > charset, makes character corruption happens in the client. > In current implementation of Hive, UTF-8 appears to be expected in server > side so client side should encode/decode string as UTF-8. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira