Re: Codereview request for 7082884: Incorrect UTF8 conversion for sequence ED 31

2011-09-28 Thread Ulf Zibis
Am 19.09.2011 22:21, schrieb Xueming Shen: The current implementation decode new String(new byte[]{(byte)0xed, 31}, "UTF8") Bug 7082884 refers to ED 31, so it should be: new String(new byte[]{(byte)0xed, 0x31}, "UTF8") -Ulf

Codereview request for 7082884: Incorrect UTF8 conversion for sequence ED 31

2011-09-19 Thread Xueming Shen
Hi, Unicode Standard added "Addition Constraints on conversion of ill-formed UTF-8" in version 5.1 [1] and updated in 6.0 again with further "clarification" [2] regarding how a "conformance" implementation should handle ill-formed UTF-8 byte sequence. Basically it says (1) the conversion pro