bug#22838: New 'Binary file' detection considered harmful

Hans Pelleboer Mon, 29 Feb 2016 20:03:53 -0800

On 03/01/2016 12:55 AM, Eric Blake wrote:

I _think_ the Austin Group is leaning towards requiring the "C" localeto always be a unibyte locale with all 256 bytes as valid characters,so neither strict 7-bit ASCII nor UTF-8 would be usable as the "C"locale; but for that to happen, POSIX would also need to allow a wayto get a UTF-8 locale easily accessible and

You do realize that this leaves all _non-US_users_, who rely ondiacritics or even different character sets entirely

for their language, completely out in the cold.

describe how it differs from the "C" locale under such a ruling. Butit's still all conjecture on what the final results will be - even inthe standards committee, gracefully documenting how locale cornercases must behave vs. leaving implementations some latitude is trickybusiness; and any such change is at least 3 or 4 years down the roadbefore it could be standardized in Issue 8 (right now, the focus is onTechnical Corrigendum 2 for Issue 7).

Already back in _1987_, an IT professor in Leiden was especiallyappointed for the streamlining ofall the competing character sets that later were merged to becomeUnicode. Given the currentstate of affairs, nearly thirty years down the road, I do not share youroptimism that this issue

will be resolved in the next couple of years.

Hans Pelleboer

bug#22838: New 'Binary file' detection considered harmful

Reply via email to