[issue4971] Incorrect title case

2009-04-25 Thread Martin v. Löwis
Martin v. Löwis added the comment: In r71894, makeunicodedata.py was fixed to correctly encode titlecase in the unicodectype database (see issue5828) In r71947, r71948, r71949, r71950, this issue is fixed by not having titlecase fall back to uppercase at run-time anymore. -- resolution

[issue4971] Incorrect title case

2009-01-17 Thread Martin v. Löwis
Martin v. Löwis added the comment: > Perhaps we are looking at different files; Indeed, I was looking at the 3.2.0 database (assuming that it would be the same in subsequent versions). > ISTM the problem is that implementing the default-to-uppercase was not > done in Tools/unicode/makeunicode

[issue4971] Incorrect title case

2009-01-17 Thread John Machin
John Machin added the comment: Martin:"""Considering this note, the simple titlecase of U+01C5 *is* U+01C4: the titlecase value is omitted, hence it is the same as uppercase, hence it is U+01C4.""" Perhaps we are looking at different files; in the Unicode 5.1 UnicodeData.txt that I downloaded (

[issue4971] Incorrect title case

2009-01-17 Thread Martin v. Löwis
Martin v. Löwis added the comment: I do think this is a bug in the Unicode database. The current approach (of falling back to uppercase if there is no title case in the Unicode database) goes back to r17708. However, even the prior version only contained explicitly the cases where a titlecase wa

[issue4971] Incorrect title case

2009-01-17 Thread Martin v. Löwis
Changes by Martin v. Löwis : -- versions: -Python 2.4, Python 2.5 ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe

[issue4971] Incorrect title case

2009-01-17 Thread Matthew Barnett
New submission from Matthew Barnett : I've found that the following 4 Unicode characters/codepoints don't behave as I'd expect: Dž (U+01C5), Lj (U+01C8), Nj (U+01CB), Dz (U+01F2). For example, u'\u01C5'.istitle() returns True and unicodedata.category(u'\u01C5') returns 'Lt', but u'\u01C5'.title() re