[issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

2019-05-03 Thread mbiggs
New submission from mbiggs : In the Unicode HOWTO: http://docs.python.org/3.3/howto/unicode.html It says the following: "UTF-8 has several convenient properties: (...) 2. A Unicode string is turned into a sequence of bytes containing no embedded zero bytes. This avoids byte-ordering i

[issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

2019-05-04 Thread mbiggs
mbiggs added the comment: So a correct statement would be "A UTF-8 string is turned into a sequence of bytes that contains embedded zero bytes only where they represent the NULL character (U+)." I think it's important to correct this because the part about processin

[issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

2019-05-08 Thread mbiggs
Change by mbiggs : -- pull_requests: +13102 ___ Python tracker <https://bugs.python.org/issue36789> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

2019-05-08 Thread mbiggs
mbiggs added the comment: Ah sent a pull request but didn't realize that redshiftzero already had. Their PR looks good to me. Thanks for fixing this! -- ___ Python tracker <https://bugs.python.org/is