[issue7255] "Default" word boundaries for Unicode data?

2010-08-15 Thread daniel mccloy
daniel mccloy added the comment: Woo-HOOO! Am very excited to hear this! Thanks, Matthew! This and also the related \w \W handling (#1693050) should be extremely useful for processing Indic text. I'm a python newbie, so will need to find some help on what I need to do to compile/in

[issue7255] "Default" word boundaries for Unicode data?

2009-11-02 Thread daniel mccloy
New submission from daniel mccloy : Regarding UTS #18 (Unicode Standards for RegEx Engines), which can be found at: http://www.unicode.org/reports/tr18/ Is there a plan or commitment for Python to implement at least "default word boundaries" (a Level 2 feature), rather than the curre