New submission from Senthil Kumaran <sent...@uthcode.com>: Opening the this bug following this discussion - http://mail.python.org/pipermail/docs/2012-March/007829.html
library/re.html \S When the LOCALE and UNICODE flags are not specified, matches any non-whitespace character; this is equivalent to the set [^ \t\n\r\f\v] With LOCALE, it will match any character not in this set, and not defined as space in the current locale. If UNICODE is set, this will match anything other than [ \t\n\r\f\v] and characters marked as space in the Unicode character properties database. This is wrong. With LOCALE set, it should be [^ \t\n\r\f\v] plus any non-space character in that locale. ---------- assignee: orsenthil components: Documentation messages: 155434 nosy: orsenthil priority: low severity: normal status: open title: Better explain re.LOCALE and re.UNICODE for \S and \W type: behavior versions: Python 2.7 _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue14258> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com