Mark (https://twitter.com/mark_e_davis)
On Thu, Aug 24, 2017 at 11:01 PM, Asmus Freytag via Unicode < unicode@unicode.org> wrote: > On 8/24/2017 10:17 AM, Andre Schappo via Unicode wrote: > >> Because there are many systems that can now handle BMP characters but not >> cannot handle SMP characters. >> >> One example being systems that use mysql utf8 (3 byte encoding) and have >> not yet updated to utf8mb4 (4 byte encoding) >> >> So, I consider it important to familiarise students with SMP characters >> as well as BMP characters. Then when they develop software they will, at >> the start, be thinking beyond ASCII and Unicode BMP characters. >> > > The thinking "beyond BMP" part only comes in when you work in encoding > forms where the BMP uses a different number of code units than the SMP (or > any other non-BMP "page"). This is true for both utf8 and utf16 but not if > you work in utf32 or in scalar values (as in the posted exercise). > > > The trick with using emoji in this lesson is that the descriptions and > images are meaningful to any English speaker, so it gets the student to > learn about character names. > > The same exercise would be more of a challenge for students whose native > tongue is not English. β> The trick with using emoji... True. For emoji names it would be better to use the CLDR names with non-anglophone audiences, since those names are available in a number of languages. eg http://www.unicode.org/cldr/charts/31/annotations/romance.html#π (that was last release's version; next release will have improvements...) β > > > A./ > > >> AndrΓ© Schappo >> >> On 24 Aug 2017, at 17:45, Shriramana Sharma <samj...@gmail.com> wrote: >>> >>> So how do you think it matters if the characters are in the BMP or SMP? >>> >> >> >> >