On Tue, Apr 07, 2009 at 09:00:50PM +0000, Thorsten Glaser wrote: > Adeodato Simó dixit: > > >I would go as far as suggesting that some package like libc6 itself > > FWIW: > > -rw-r--r-- 1 tg tg 238336 Apr 7 22:59 en_US.UTF-8/LC_CTYPE > > It's not *that* much... > > >Finally, this stuff that Roger proposes about making “C” be UTF-8, and > >create some C.ASCII for people needing that, sounds shocking at the same > >time as appealing. > > It won't work, because in a UTF-8 locale, for example stdio > functions must reject "invalid" (not valid UTF-8) input, so > it would not be 8-bit clean/transparent any more.
I wasn't aware that this level of checking was performed, though it does make sense. But, does it not reject non 7-bit input in the C locale for completeness? Should tools doing "raw" I/O not be using lower level interfaces such as fread() and fwrite() rather than the "formatted" print functions which are specified to behave in a locale-dependent manner? This strikes me as bugs in the form of assumptions in the code which should be fixed, rather than a fundamental problem with the locale itself using a non-7-bit-ASCII codeset. Thanks, Roger -- .''`. Roger Leigh : :' : Debian GNU/Linux http://people.debian.org/~rleigh/ `. `' Printing on GNU/Linux? http://gutenprint.sourceforge.net/ `- GPG Public Key: 0x25BFB848 Please GPG sign your mail.
signature.asc
Description: Digital signature