On 15/11/2021 17:53, Pushkar N Kulkarni wrote:
Hi there,


OpenJDK currently supports version 2000 of the GB18030 
(https://en.wikipedia.org/wiki/GB_18030) character set viz. GB18030-2000. The 
character mappings corresponding to Unicode codepoints '\u1E3F' and '\uE7C7' 
were swapped in a new version of the character set named GB18030-2005. I learn 
that this corrected a mistake in version 2000.

OpenJDK does not support version 2005 as yet. Can someone please help me with 
reasons for the same, if any?

We do have users requesting for 2005 support. While Linux (RHEL 7/8) has moved 
to supporting GB18030-2005 via glibc, Windows 10 and AIX 7.2 still have 
GB18030-2000 base. That means OpenJDK cannot move to GB18030-2005 base as yet. 
However, we can support both the versions until all the supported platforms 
move to GB18030-2005 base. Would that be an acceptable proposition?

If we can have an enhancement request opened, I'd be glad to contribute the 
GB18030-2005 charset implementation.

If I read this correctly, then your proposal is for GB18030 to continue to be GB18030-2000 but you would introduce a new charset GB18030 map to GB18030-2005 for the new version. Are you also proposing a system property or some means to have GB18030 be GB18030-2005 until the time is right to make it the default?

-Alan

Reply via email to