[issue26917] unicodedata.normalize(): bug in Hangul Composition

Armin Rigo Tue, 03 May 2016 03:29:55 -0700

Armin Rigo added the comment:

See also 
https://bitbucket.org/pypy/pypy/issues/2289/incorrect-unicode-normalization .  
It seems that you reached the same conclusion than the OP in that issue: the 
problem would really be that normalizing "\uafb8\u11a7" should not drop the 
second character.  Both Python and PyPy do that, but Python adds the 
"is_normalized()" check, so in some cases it returns the correct unmodified 
result.


----------

_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue26917>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue26917] unicodedata.normalize(): bug in Hangul Composition

Reply via email to