[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

2012-09-26 Thread STINNER Victor
Changes by STINNER Victor : -- versions: +Python 3.2, Python 3.3, Python 3.4 ___ Python tracker ___ ___ Python-bugs-list mailing list

[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

2012-09-26 Thread Ezio Melotti
Changes by Ezio Melotti : -- stage: test needed -> patch review ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

2011-03-10 Thread Antoine Pitrou
Changes by Antoine Pitrou : -- nosy: +haypo ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.

[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

2011-03-10 Thread Amaury Forgeot d'Arc
Amaury Forgeot d'Arc added the comment: The utf16 incremental codec does not like incomplete surrogate pairs. Patch attached. I also plan to refactor all the test_partial() functions of test_codecs, to give them a common implementation. -- keywords: +patch nosy: +amaury.forgeotdarc Add

[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

2011-03-10 Thread Ezio Melotti
Changes by Ezio Melotti : -- nosy: +ezio.melotti stage: -> test needed ___ Python tracker ___ ___ Python-bugs-list mailing list Unsub

[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

2011-03-10 Thread Yuriy Pilgun
New submission from Yuriy Pilgun : Reading UTF-16 text file with module 'codecs' fails, if surrogate pair is located at 72-character boundary. Attached python script fails with message: UnicodeDecodeError: 'utf16' codec can't decode bytes in position 70-71: unexpected end of data The reason i