New submission from Senthil Kumaran <sent...@uthcode.com>:

Opening the this bug following this discussion - 
http://mail.python.org/pipermail/docs/2012-March/007829.html

library/re.html

\S

When the LOCALE and UNICODE flags are not specified, matches any non-whitespace 
character; this is equivalent to the set [^ \t\n\r\f\v] With LOCALE, it will 
match any character not in this set, and not defined as space in the current 
locale. If UNICODE is set, this will match anything other than [ \t\n\r\f\v] 
and characters marked as space in the Unicode character properties database.

This is wrong. With LOCALE set, it should be [^ \t\n\r\f\v] plus any non-space 
character in that locale.

----------
assignee: orsenthil
components: Documentation
messages: 155434
nosy: orsenthil
priority: low
severity: normal
status: open
title: Better explain re.LOCALE and re.UNICODE for \S and \W
type: behavior
versions: Python 2.7

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue14258>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to