Ma Lin <malin...@163.com> added the comment:
> But how many new Python web application use CJK codec instead of UTF-8? A CJK character usually takes 2-bytes in CJK encodings, but takes 3-bytes in UTF-8. I tested a Chinese book: in GBK: 853,025 bytes in UTF-8: 1,267,523 bytes For CJK content, UTF-8 is wasteful, maybe CJK encodings will not be eliminated. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <https://bugs.python.org/issue41330> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com