I have not been able to identify a defect in the scheme specified for UTF-16 to UTF-8.
I have pointed to implementations that are sometimes unsuccessful, and their failures have some common characteristics. For now, I avoid UTF-8 when I can. I expect that it will be problem-free at some not at all remote time in the future. I certainly was not prescient enough to think so ten years ago, but I now a little regret the availability of UTF-8. Its unsuitability for use with non-alphabetic text or with mixed 'alphabetic' and non-alphabetic text, like written Japanese] has produced a sharp difference in Eastern and Western Unicode usage patterns that is at best unfortunate. John Gilmore, Ashland, MA 01721 - USA ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN
