Re: Glyphs and graphemes [was Re: Cult-like behaviour]

Marko Rauhamaa Mon, 16 Jul 2018 13:01:31 -0700

Steven D'Aprano <steve+comp.lang.pyt...@pearwood.info>:
> Under that standard definition, UTF-8 and UTF-16 are variable-width,
> and UTF-32 is fixed-width.
>
> But I'll accept that UTF-32 is variable-width if Marko accepts that
> ASCII is too.


If that makes you happy, fine. The point is, UTF-32 has no advantages
over UTF-8. And I'm referring to the text abstraction as seen by the
programmer. It has nothing to do with the layout of bytes inside
CPython.

I use UTF-8 in my C programs and sense no disadvantage. I have never
felt a need for wchar_t. Similarly, I had a small Python2 program that
quizzed me about Hebrew vocabulary with Finnish translations and
Esperanto pronunciation instructions. All UTF-8. No unicode strings. (I
*have* converted that to Python3 just to be on the bleeding edge, but it
didn't give me any advantages over Python2.)


Marko
-- 
https://mail.python.org/mailman/listinfo/python-list

Re: Glyphs and graphemes [was Re: Cult-like behaviour]

Reply via email to