New submission from Mark Tolonen <metol...@hotmail.com>: This is on Windows 7 SP1. Run 'chcp 65001' then Python from a console. Note the extra characters when non-ASCII characters are in the string. At a guess it appears to be using the UTF-8 byte length of the internal representation instead of the character count.
Python 3.3.0a1 (default, Mar 4 2012, 17:27:59) [MSC v.1500 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> print('hello') hello >>> print('p\u012bny\u012bn') pīnyīn n >>> print('\u012b'*10) īīīīīīīīīī �īīīī �ī ---------- components: IO, Unicode, Windows messages: 155149 nosy: ezio.melotti, metolone priority: normal severity: normal status: open title: console w/ cp65001 displays extra characters for non-ascii strings. type: behavior versions: Python 3.3 _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue14227> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com