Terry J. Reedy added the comment:

3.3.2:
>>> '\uD80C\uDC80' == '\U00013080'
False

The statement that surrogate code units can be encoded this way is still true. 
Indeed, it is now the only way to get such code units into a string. The 
suggestion that a pair will make an astral char is now false. The sentence 
could be changed to 

"Individual surrogate code units can be encoded using this escape sequence."

On the other hand, the same is true of *any* BMP char, including all the 
*other* non-graphic chars that can only be entered this way. So I think the 
sentence, if not deleted, should be replaced by what seems to me a more useful 
(complete) statement.

"Any Basic Multilingual  Plane (BMP) codepoint can be encoded using this escape 
sequence."

----------
nosy: +terry.reedy

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue18572>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to