Gilles Ganault wrote:
Hello
Some of the adresses are missing a space between the streetname and
the ZIP code, eg. "123 Main Street01159 Someville"
The following regex doesn't seem to work:
#Check for any non-space before a five-digit number
re_bad_address = re.compile('([^\s].)(\d{5}) ',re.I | re.S | re.M)
-------------------------------------^
I also tried ([^ ].), to no avail.
--------------------^
What is the right way to tell the Python re module to check for any
non-space character?
It looks like it's these periods that are throwing you off. Just
remove them. For a 3rd syntax:
(\S)(\d{5})
the \S (capital, instead of "\s") is "any NON-white-space character"
-tkc
--
http://mail.python.org/mailman/listinfo/python-list