I may be misremembering what I've read here but I thought that Dan said that for variable length encodings (such as shift-JIS) parrot would store the byte(s) in memory in constant size 16 or 32 bit integers, rather than the (external) variable length byte sequence, as this gives O(1) random access, and avoids much coding pain.
However, he made no explicit comment about UTF8 (just another variable length encoding), which would imply that parrot will be storing UTF8 in this way.
Yup. UTF8 is Just another variable-width encoding. Do anything with it and we convert it to a fixed-width encoding, in this case UTF32.
--
Dan
--------------------------------------"it's like this"------------------- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk