On Mon, Apr 22, 2002 at 11:13:47PM -0400, Gaute B Strokkenes wrote:
> Any Unicode character will fit in 21 bits, so you have plenty of bits
> left over for attirbutes of various sorts.  Furthermore, the Unicode
> standard guarantees that this will alwasy be so.

Is this also true for UCS-4?  Because that is what wchar_t is on GNU
systems.  It would be nice if you could verify that.

I feel a bit uncomfortable with overloading the wchar this way, because it
does not allow you to run iconv_t on the mapped memory, you have to copy it
first.  But it might be an option.
 
> They yucky bit is when you consider characters with width not equal to
> one.  CJK ideographs are twice as wide as Latin charactes, and
> composing characters generally occupy the same cell as the base
> character.  This breaks the model with one character per cell,
> unfortunately.

We won't support that for this very reason in the first version of the
console.  Like on GNU/Linux, we will focus on Unicode support level 1 and
normalization form C.  (Hope I got this right, I am citing these from memory
:)

Thanks,
Marcus

-- 
`Rhubarb is no Egyptian god.' Debian http://www.debian.org [EMAIL PROTECTED]
Marcus Brinkmann              GNU    http://www.gnu.org    [EMAIL PROTECTED]
[EMAIL PROTECTED]
http://www.marcus-brinkmann.de

_______________________________________________
Bug-hurd mailing list
[EMAIL PROTECTED]
http://mail.gnu.org/mailman/listinfo/bug-hurd

Reply via email to