I now have an implementation ready for the reader in the [csv] source code:
https://svn.apache.org/repos/asf/commons/sandbox/csv/trunk/src/main/java/org/apache/commons/csv/UnicodeUnescapeReader.java I think I'll also handle other escape sequences such as \n or \t. Emmanuel Bourg Le 12/11/2011 00:27, Emmanuel Bourg a écrit :
Hi, It seem that unescaping unicode escape sequences (\u1234) in input stream is a common need. [configuration] does it for PropertiesConfiguration, and [csv] can also decode these sequences optionally. In the other direction, there is also a need to escape unicode characters not supported by a given encoding when writing (see CONFIGURATION-457). I think these features could be implemented as a UnicodeUnescapeReader and a UnicodeEscapeWriter that might fit into [io]. For the reader, any unicode escape sequence would be transformed into the corresponding unicode character, or ignored if the sequence is not valid. For the writer, a target charset would be specified in the constructor, and any character not supported by this charset would be turned into \uxxxx. What do you think? Emmanuel Bourg
smime.p7s
Description: S/MIME Cryptographic Signature