[issue27878] Unicode word boundries

2016-08-27 Thread SilentGhost
SilentGhost added the comment: regex module is not in standard library, on the latest 3.6 branch re module breaks on curly apostrophe just fine. Perhaps, try reporting this issue on the bitbucket tracker? -- nosy: +SilentGhost resolution: -> not a bug stage: -> resolved status: open

[issue27878] Unicode word boundries

2016-08-27 Thread mohammad aghanabi
New submission from mohammad aghanabi: According to [UAX #29](http://unicode.org/reports/tr29) - unicode word boundaries (rule WB5a), an apostrophe includes U+0027 ( ' ) APOSTROPHE and U+2019 ( ’ ) RIGHT SINGLE QUOTATION MARK (curly apostrophe). However regex module only implements U+0027 and