Hi,
It seem that unescaping unicode escape sequences (\u1234) in input
stream is a common need. [configuration] does it for
PropertiesConfiguration, and [csv] can also decode these sequences
optionally.
In the other direction, there is also a need to escape unicode
characters not supported by a given encoding when writing (see
CONFIGURATION-457).
I think these features could be implemented as a UnicodeUnescapeReader
and a UnicodeEscapeWriter that might fit into [io].
For the reader, any unicode escape sequence would be transformed into
the corresponding unicode character, or ignored if the sequence is not
valid.
For the writer, a target charset would be specified in the constructor,
and any character not supported by this charset would be turned into \uxxxx.
What do you think?
Emmanuel Bourg
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org