Lars Gullik Bjønnes wrote: Thanks for supplying the bigger picture. I've only one point to make:
> (Even UCS-4 is not "one-codepoint" "one-glyph", combining chars are > required for proper display) Sure. But that's not information needed by the CORE, is it? The core does act on (strings of) single codepoints. All paragraph breaking etc, acts on single code points. In other words, if ICU can iterate over single codepoints in the unicoded string, then the core algorithms won't need to change at all. Right? -- Angus