Re: unicode direction control characters

2018-01-02 Thread Random832
On Tue, Jan 2, 2018, at 10:36, Robin Becker wrote: > >> u'\u200e28\u200e/\u200e09\u200e/\u200e1962' > > I guess I'm really wondering whether the BIDI control characters have any > semantic meaning. Most numbers seem to be LTR. > > If I saw u'\u200f12' it seems to imply that the characters should

Re: unicode direction control characters

2018-01-02 Thread Chris Angelico
On Wed, Jan 3, 2018 at 2:36 AM, Robin Becker wrote: > On 02/01/2018 15:18, Chris Angelico wrote: >> >> On Wed, Jan 3, 2018 at 1:30 AM, Robin Becker wrote: >>> >>> I'm seeing some strange characters in web responses eg >>> >>> u'\u200e28\u200e/\u200e09\u200e/\u200e1962' >>> >>> for a date of birth

Re: unicode direction control characters

2018-01-02 Thread Robin Becker
On 02/01/2018 15:18, Chris Angelico wrote: On Wed, Jan 3, 2018 at 1:30 AM, Robin Becker wrote: I'm seeing some strange characters in web responses eg u'\u200e28\u200e/\u200e09\u200e/\u200e1962' for a date of birth. The code \u200e is LEFT-TO-RIGHT MARK according to unicodedata.name. I tried

Re: unicode direction control characters

2018-01-02 Thread Chris Angelico
On Wed, Jan 3, 2018 at 1:30 AM, Robin Becker wrote: > I'm seeing some strange characters in web responses eg > > u'\u200e28\u200e/\u200e09\u200e/\u200e1962' > > for a date of birth. The code \u200e is LEFT-TO-RIGHT MARK according to > unicodedata.name. I tried unicodedata.normalize, but it leaves