[issue30838] re \w does not match some valid Unicode characters

David Lord Wed, 05 Jul 2017 08:20:01 -0700

David Lord added the comment:

After thinking about it more, I guess I misunderstood what \w was doing 
compared to isidentifier. Since Python just relies on the Unicode database, 
there's not much to be done anyway. Closing this.


For anyone interested, we ended up with a hybrid approach for lexing 
identifiers: build a regex group that includes all valid ranges not matched by 
\w, then validate with isidentifier later. 
https://github.com/pallets/jinja/pull/731/files

----------
resolution:  -> not a bug
stage:  -> resolved
status: open -> closed

_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue30838>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue30838] re \w does not match some valid Unicode characters

Reply via email to