On Mon, Jun 05, 2017 at 12:32:02AM +0100, foss.freedom wrote: > Not quite sure I understand why since I dont recognise that locale - > some-sort of special "C language" unicode locale? - but it works so > thats the main thing.
C.UTF-8 is a locale that's same as C except for assigning a meaning to bytes above 127 and declaring what wide-char functions do -- both of which are undefined in C. On Debian it's guaranteed to be available no matter what packages you have installed or configured. It's even a good choice for regular human use: en_US.UTF-8 in glibc for example has pants-on-the-head sanity-challenged collation: 0 9 0.9.0 0.9.0-a0-foo-bar ({---=[ 0.9.0-a11 ]=---}) 0.9.0-a17-quux (0.9.0-a2) 0.9.0+a99-1 0.9.0-rc1 0.9.1 0 9 9 ({---=[ 0.9-a11 ]=---}) 0.9 ab while other systems, and C.UTF-8, do: (0.9.0-a2) ({---=[ 0.9.0-a11 ]=---}) ({---=[ 0.9-a11 ]=---}) 0 9 0 9 9 0.9 ab 0.9.0 0.9.0+a99-1 0.9.0-a0-foo-bar 0.9.0-a17-quux 0.9.0-rc1 0.9.1 Besides the debatable choice of case-sensitivity, this works for all languages which match the Unicode order, such as English and Russian, but not Polish were a < ą < b. -- ⢀⣴⠾⠻⢶⣦⠀ A tit a day keeps the vet away. ⣾⠁⢰⠒⠀⣿⡁ ⢿⡄⠘⠷⠚⠋⠀ (Rejoice as my small-animal-murder-machine got unbroken after ⠈⠳⣄⠀⠀⠀⠀ nearly two years of no catch!)