Ma Lin added the comment: Andre Lemburg,
We don't need any modify, A844 is in GBK but not in GB2312, so no need to add it into GB2312. Your logic is right, it's hard to judge which one is wrong. But U+30FB (· KATAKANA MIDDLE DOT) and U+2015 (— HORIZONTAL BAR) have no reason among these Chinese common punctuation symbol. A1A2-A1B7: 、 。 ・ ˉ ˇ ¨ 〃 々 ― ~ ‖ … ‘ ’“ ” 〔 〕 〈 〉 《 》 If they are U+00B7 (· MIDDLE DOT) and U+2014 (— EM DASH), this section looks more reasonable. GB2312 was published in early 1980s, it seems there was a historical accident. Luckily, most programming languages are on the same side. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue24036> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com