[issue36671] str.lower() looses character information when working with UTF-8

2019-04-20 Thread SilentGhost
SilentGhost added the comment: This is the behaviour according to the Unicode standard version 11. This is not an oversight on part of CPython implementation, this character (among others) lowercases to two characters. -- nosy: +SilentGhost resolution: -> not a bug stage: -> resolv

[issue36671] str.lower() looses character information when working with UTF-8

2019-04-20 Thread Kadam Parikh
New submission from Kadam Parikh : When converting a particular UTF-8 character "İ" to lowercase, it doesn't behave correctly. It returns two lowercase characters instead of one. This is not as desired. Code: >>> print("\u0130") İ >>> print("\u0130".lower()) i̇ >>> -- components: Uni