A Unicode string in UTF-8 encoding form could be ill-formed if the bytes don't 
follow the specification for UTF-8, for example.
Given that answer, add "in UTF-32" to my email just now, for simplicity's sake. Or let's simply assume we're dealing with some sort of sequence of abstract integers from hex+0 to hex+10FFFF, to abstract away from "encoding form" issues.

Stephan


Reply via email to