On ma, 2007-06-18 at 13:37 +0100, Pierre Habouzit wrote: > On Mon, Jun 18, 2007 at 10:48:04AM +0100, Pierre Habouzit wrote: > > multi-byte one would be really really bad (as you would end up with e.g. > > strings split in the middle of a point code, *brrr* you definitely don't > > want that). > > I wasn't clear it seems, but what I mean is if a programs assumes he's > dealing with ascii,
This buggy assumption seems to happen in every locale, not just C.UTF-8, and in every other case we treat it as a bug. Is there a standard that says every C.* locale must have the same single byte character set as the plain C locale? (Incidentally, the standard for the C language does not require the character set in the C locale to be ASCII; EBCDIC, for example, works as well. See 5.2.1, "Character sets", for the full description. You might be able to find a copy of the standard by searching for ISO/IEC 9899.) -- Mulla on halu häkätä ja mulla on siihen taito