Re: [fpc-devel] Unicode support (again)

Jonas Maebe Tue, 11 Nov 2008 04:45:35 -0800


On 11 Nov 2008, at 13:39, Michael Schnell wrote:

a) "ü": "LATIN SMALL LETTER U WITH DIAERESIS", encoded as $C3 $BC
b) "ü": "LATIN SMALL LETTER U", encoded as $75, followed by"COMBINING DIAERESIS", which is encoded as $CC $88
I see, but I fail to see the sense of providing two different UTF8code variants for the same unicode character.

Probably because different kinds of string processing can work moreefficiently with one or the other encoding. Anyway, why it is the caseis moot: the fact is that this is possible (regardless of whether youuse UTF-8, UTF-16 or UTF-32) and therefore you have to deal with itwhen you use unicode.



Jonas_______________________________________________
fpc-devel maillist  -  [email protected]
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Unicode support (again)

Reply via email to