I have not been able to identify a defect in the scheme specified for
UTF-16 to UTF-8.

I have pointed to implementations that are sometimes unsuccessful, and
their failures have some common characteristics.

For now, I avoid UTF-8 when I can.  I expect that it will be
problem-free at some not at all remote time in the future.

I certainly was not prescient enough to think so ten years ago, but I
now a little regret the availability of UTF-8.   Its unsuitability for
use with non-alphabetic text or with  mixed  'alphabetic' and
non-alphabetic text, like written Japanese] has produced a sharp
difference in Eastern and Western Unicode usage patterns that is at
best unfortunate.

John Gilmore, Ashland, MA 01721 - USA

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

Reply via email to