> * Unicode isn't necessarily universal, or might stop to be so in future.
> If a character is not representable in Unicode, and you chose to use
> Unicode for everything, you're screwed

There are provision for private usage codepoints.

> * related to the previous point, some other character encodings might
> not have a lossless round-trip conversion.

Did we need that? The intention is that strings are stored in the
format wanted and not recoded without a good reason.

>> need a translation table. The only point to solve is we need some
>> special way to work with fixed-8 with no intended character
>> representation.
> Introducing the "no character set" character set is just a special case
> of arbitrary character sets. I see no point in using the special case
> over the generic one.

Because is special, and we need to deal with his speciality in any
case. Just concatenating it with any other is plain wrong. Just
treating it as iso-8859-1 is not taken in as plain binary at all.

But the main point is that the encoding issues is complicated enough
even inside unicode, and adding another layer of complexity will make
it worse.

-- 
Salu2

Reply via email to