Re: Python Unicode handling wins again -- mostly

Ned Batchelder Mon, 02 Dec 2013 13:16:23 -0800

On 12/2/13 3:38 PM, Ethan Furman wrote:

On 11/29/2013 04:44 PM, Steven D'Aprano wrote:


Out of the nine tests, Python 3.3 passes six, with three tests being
failures or dubious. If you believe that the native string type should
operate on code-points, then you'll think that Python does the right
thing.


I think Python is doing it correctly.  If I want to operate on
"clusters" I'll normalize the string first.

Thanks for this excellent post.

--
~Ethan~

This is where my knowledge about Unicode gets fuzzy. Isn't it the casethat some grapheme clusters (or whatever the right word is) can't benormalized down to a single code point? Characters can accept manyaccents, for example. In that case, you can't always normalize and usethe existing string methods, but would need more specialized code.


--Ned.

--
https://mail.python.org/mailman/listinfo/python-list

Re: Python Unicode handling wins again -- mostly

Reply via email to