[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: Any updates? I need this fix for my project. -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: I added some test cases for this issue. Please, someone check this. -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: I think it can be merged. Is there anything I need to do? -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: This patch fixes changes in Unicode 4.1.0. I think it well reviewed and it is time to merge. Who can commit this patch? @animalize says: Let me give a supplement: Before Unicode 4.1.0 (draft), here is: TBase <= code <= TBase+TCount see: http://www.unicode.org/reports/tr15/tr15-24.html#hangul_composition After Unicode 4.1.0, here is TBase < code < TBase+TCount, which in line with the latest version (Unicode 10.0) see: http://www.unicode.org/reports/tr15/tr15-25.html#hangul_composition This change happened in 2005. -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: Hello? -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bug in unicodedata.normalize: u1176, u11a7 and u11c3
Changes by Wonsup Yoon : -- title: bug in unicodedata.normalize: u1176 -> bug in unicodedata.normalize: u1176, u11a7 and u11c3 ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bug in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: Is there anything need more? -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bug in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: Ok, I'll do it. -- ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Changes by Wonsup Yoon : -- title: bug in unicodedata.normalize: u1176, u11a7 and u11c3 -> bugs in unicodedata.normalize: u1176, u11a7 and u11c3 ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Changes by Wonsup Yoon : -- pull_requests: +2029 ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bugs in unicodedata.normalize: u1176, u11a7 and u11c3
Wonsup Yoon added the comment: Hello! -- ___ Python tracker <https://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bug in unicodedata.normalize: u1176
New submission from Wonsup Yoon: unicodedata can't normalize(NFC) hangul strings which contain \u1176(HANGUL JUNGSEONG A-O). >>> from unicodedata import normalize >>> normalize("NFC", "\u1100\u1176\u11a8") '깍' => should be "\u1100\u1176\u11a8" not '깍' (\uae4d) I attached a patch for this issue. (Fixing boundary of modern medial vowels) -- components: Unicode files: u1176.patch keywords: patch messages: 287077 nosy: ezio.melotti, haypo, pusnow priority: normal severity: normal status: open title: bug in unicodedata.normalize: u1176 versions: Python 2.7, Python 3.6 Added file: http://bugs.python.org/file46535/u1176.patch ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue29456] bug in unicodedata.normalize: u1176
Wonsup Yoon added the comment: I think you are right. The modern final consonants is [11a8..11c2]. I attached another patch for this issue. -- Added file: http://bugs.python.org/file46536/u11a7u11c3.patch ___ Python tracker <http://bugs.python.org/issue29456> ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com