2008/11/11 Michael Schnell <[EMAIL PROTECTED]>:
>
>> a) "ü": "LATIN SMALL LETTER U WITH DIAERESIS", encoded as $C3 $BC
>> b) "ü": "LATIN SMALL LETTER U", encoded as $75, followed by "COMBINING
>> DIAERESIS", which is encoded as $CC $88
>
> I see, but I fail to see the sense of providing two different UTF8 code
> variants for the same unicode character.

I believe it's got to do with case conversion, sorting etc... I think
Mac OS always uses the non-normalized form and Linux the normalized
one.  But yes, those combining characters are a pain to handle. I
tried in fpGUI Toolkit and still having issues with things like cursor
position calculations etc.. :-(


Regards,
  - Graeme -


_______________________________________________
fpGUI - a cross-platform Free Pascal GUI toolkit
http://opensoft.homeip.net/fpgui/
_______________________________________________
fpc-devel maillist  -  [email protected]
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Reply via email to