On Thu, Jan 16, 2014 at 1:55 AM, <wxjmfa...@gmail.com> wrote: > Le mercredi 15 janvier 2014 13:13:36 UTC+1, Ned Batchelder a écrit : > >> >> ... more than one codepoint makes up a grapheme ... > > No
Yes. http://www.unicode.org/faq/char_combmark.html >> In Unicode terms, an encoding is a mapping between codepoints and bytes. > > No Yes. http://www.unicode.org/reports/tr17/ Specifically: "Character Encoding Form: a mapping from a set of nonnegative integers that are elements of a CCS to a set of sequences of particular code units of some specified width, such as 32-bit integers" Or are you saying that www.unicode.org is wrong about the definitions of Unicode terms? ChrisA -- https://mail.python.org/mailman/listinfo/python-list