Reader

Emmanuel Bourg Fri, 11 Nov 2011 15:28:12 -0800

Hi,

It seem that unescaping unicode escape sequences (\u1234) in inputstream is a common need. [configuration] does it forPropertiesConfiguration, and [csv] can also decode these sequencesoptionally.

In the other direction, there is also a need to escape unicodecharacters not supported by a given encoding when writing (seeCONFIGURATION-457).

I think these features could be implemented as a UnicodeUnescapeReaderand a UnicodeEscapeWriter that might fit into [io].

For the reader, any unicode escape sequence would be transformed intothe corresponding unicode character, or ignored if the sequence is notvalid.

For the writer, a target charset would be specified in the constructor,and any character not supported by this charset would be turned into \uxxxx.


What do you think?

Emmanuel Bourg

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org

[io] Unicode escape/unescape Writer/Reader

Reply via email to