At Wed,  5 Jun 2002 05:38:38 -0400 (EDT),
Roland McGrath wrote:
> I was (and am still sort of) unclear on UTF-32 vs UCS-4.

Their formats are identical but their semantics is different, because
UCS-4 is defined in ISO/IEC 10646, while UTF-32 is defined in Unicode
Standard.

Basically ISO/IEC 10646 is a standard only for characters (in normal
sense), so you may not represent invalid characters in UCS-4
(e.g. 0xFFFF).

On the other hand, UTF-32 disallows you to use any code value that
cannot be coded in UTF-16, that is, you can use only the range
[0x00000000, 0x0010FFFF].

There are some other differences, but, in brief, you can assume that
they are the same, as long as you use them as internal encodings.

Cheers,
Okuji

_______________________________________________
Bug-hurd mailing list
[EMAIL PROTECTED]
http://mail.gnu.org/mailman/listinfo/bug-hurd

Reply via email to