Re: regular expression unicode character class trouble

2005-09-05 Thread Diez B. Roggisch
Steven Bethard wrote: > I'd use something like r"[^_\d\W]", that is, all things that are neither > underscores, digits or non-alphas. In action: > > py> re.findall(r'[^_\d\W]+', '42badger100x__xxA1BC') > ['badger', 'x', 'xxA', 'BC'] > > HTH, Seems so, great! Diez -- http://mail.python.org/ma

Re: regular expression unicode character class trouble

2005-09-04 Thread Steven Bethard
Diez B. Roggisch wrote: > Hi, > > I need in a unicode-environment the character-class > > set("\w") - set("[0-9]") > > or aplha w/o num. Any ideas how to create that? I'd use something like r"[^_\d\W]", that is, all things that are neither underscores, digits or non-alphas. In action: py> re