On Mon, Dec 16, 2002 at 01:07:36PM +0000, mcharity @ vendian. org wrote: This question is actually independent of the patch (which looks good)
> simply returns the C<INTVAL> it is passed; C<string_utf8_max_bytes>, on the > other hand, returns three times the value that it is passed because a > UTF8 character may occupy up to three bytes. Should that really be the number 3? I thought that the UTF8 representation of code points outside the base Unicode plane could get longer than that. Nicholas Clark