LGB> | Sorry, I don't understand. The length of the string U+0065 U+0301
LGB> | certainly is 2, regardless of how the rendering engine displays this.
LGB> | Of course, the rendering engine should render it as "é" because U+0301
LGB> | is a combining character, but the string length is still 2.

LGB> Not if I want to count the number of characters in the document.

That's true. The problem is that you probably need the "real" string length
information anyway for string operations. So if there's a need for a
figure for number of characters in the document excluding combining
characters, but including some combining characters (like Arabic or
Hebrew vowels), one will need a secondary function.

Cheers -
  Philipp Reichmuth                            mailto:[EMAIL PROTECTED]

--
With searching comes loss / and the presence of absence / The data, not found

Reply via email to