[issue24863] Incoherent bevavior with umlaut in regular expressions

2015-08-14 Thread Zachary Ware
Changes by Zachary Ware : -- stage: -> resolved ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.p

[issue24863] Incoherent bevavior with umlaut in regular expressions

2015-08-14 Thread R. David Murray
R. David Murray added the comment: Don't be embarrassed; a report like this turns up on this tracker about every three or four months. Unfortunately there's nothing we can do to make the situation better because of backward compatibility concerns. -- nosy: +r.david.murray ___

[issue24863] Incoherent bevavior with umlaut in regular expressions

2015-08-14 Thread Christian Klein
Christian Klein added the comment: Wow, that's very embarrassing. Thank you. (I tried to get further help before but nobody recognized that stupid mistake) -- ___ Python tracker

[issue24863] Incoherent bevavior with umlaut in regular expressions

2015-08-14 Thread eryksun
eryksun added the comment: You're passing re.UNICODE (32) as the value of the count parameter, i.e. the function signature is re.sub(pattern, repl, string, count=0, flags=0). -- nosy: +eryksun resolution: -> not a bug status: open -> closed ___ Pyth

[issue24863] Incoherent bevavior with umlaut in regular expressions

2015-08-14 Thread Christian Klein
New submission from Christian Klein: The Python 2.7 re module seems not to agree what to consider a word character: import re s = u'f\xfc' print re.sub('\W', '*', s, re.UNICODE) print re.findall('\w', s, re.UNICODE) The application of re.sub removes the character u'ΓΌ' which implies it's consid