Antoine Pitrou added the comment:

Yes, I think 3.3 is correct here. I think it was Benjamin who fixed/improved 
the behaviour of casing methods. Compare 3.3:

>>> "ß".upper()
'SS'

with 3.2:

>>> "ß".upper()
'ß'

Also, 3.2 loses information:

>>> "KİTAP".lower().upper()
'KITAP'
>>> ascii("KİTAP".lower().upper())
"'KITAP'"

while 3.3 retains it:

>>> "KİTAP".lower().upper()
'KİTAP'
>>> ascii("KİTAP".lower().upper())
"'KI\\u0307TAP'"

You can get the combined form again with unicodedata.normalize:

>>> unicodedata.normalize("NFC", "KİTAP".lower().upper())
'KİTAP'

----------
nosy: +benjamin.peterson, haypo, lemburg, pitrou
resolution:  -> invalid
status: open -> closed

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue17252>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to