Hi,

It seem that unescaping unicode escape sequences (\u1234) in input stream is a common need. [configuration] does it for PropertiesConfiguration, and [csv] can also decode these sequences optionally.

In the other direction, there is also a need to escape unicode characters not supported by a given encoding when writing (see CONFIGURATION-457).

I think these features could be implemented as a UnicodeUnescapeReader and a UnicodeEscapeWriter that might fit into [io].

For the reader, any unicode escape sequence would be transformed into the corresponding unicode character, or ignored if the sequence is not valid.

For the writer, a target charset would be specified in the constructor, and any character not supported by this charset would be turned into \uxxxx.

What do you think?

Emmanuel Bourg

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org

Reply via email to