2008/11/11 Michael Schnell <[EMAIL PROTECTED]>: > >> a) "ü": "LATIN SMALL LETTER U WITH DIAERESIS", encoded as $C3 $BC >> b) "ü": "LATIN SMALL LETTER U", encoded as $75, followed by "COMBINING >> DIAERESIS", which is encoded as $CC $88 > > I see, but I fail to see the sense of providing two different UTF8 code > variants for the same unicode character.
I believe it's got to do with case conversion, sorting etc... I think Mac OS always uses the non-normalized form and Linux the normalized one. But yes, those combining characters are a pain to handle. I tried in fpGUI Toolkit and still having issues with things like cursor position calculations etc.. :-( Regards, - Graeme - _______________________________________________ fpGUI - a cross-platform Free Pascal GUI toolkit http://opensoft.homeip.net/fpgui/
_______________________________________________ fpc-devel maillist - [email protected] http://lists.freepascal.org/mailman/listinfo/fpc-devel
