On Mon, Jun 05, 2017 at 12:32:02AM +0100, foss.freedom wrote:
> Not quite sure I understand why since I dont recognise that locale -
> some-sort of special "C language" unicode locale? - but it works so
> thats the main thing.

C.UTF-8 is a locale that's same as C except for assigning a meaning to bytes
above 127 and declaring what wide-char functions do -- both of which are
undefined in C.

On Debian it's guaranteed to be available no matter what packages you have
installed or configured.

It's even a good choice for regular human use: en_US.UTF-8 in glibc for
example has pants-on-the-head sanity-challenged collation:

0 9
0.9.0
0.9.0-a0-foo-bar
({---=[ 0.9.0-a11 ]=---})
0.9.0-a17-quux
(0.9.0-a2)
0.9.0+a99-1
0.9.0-rc1
0.9.1
0 9 9
({---=[ 0.9-a11 ]=---})
0.9 ab

while other systems, and C.UTF-8, do:

(0.9.0-a2)
({---=[ 0.9.0-a11 ]=---})
({---=[ 0.9-a11 ]=---})
0 9
0 9 9
0.9 ab
0.9.0
0.9.0+a99-1
0.9.0-a0-foo-bar
0.9.0-a17-quux
0.9.0-rc1
0.9.1

Besides the debatable choice of case-sensitivity, this works for all
languages which match the Unicode order, such as English and Russian, but
not Polish were a < ą < b.


-- 
⢀⣴⠾⠻⢶⣦⠀ A tit a day keeps the vet away.
⣾⠁⢰⠒⠀⣿⡁
⢿⡄⠘⠷⠚⠋⠀ (Rejoice as my small-animal-murder-machine got unbroken after
⠈⠳⣄⠀⠀⠀⠀ nearly two years of no catch!)

Reply via email to