Re: regular expression i'm going crazy

Robert Kern Mon, 16 May 2011 09:55:16 -0700

On 5/16/11 11:25 AM, Tracubik wrote:

pls help me fixing this:


import re
s = "linka la baba"
re_s = re.compile(r'(link|l)a' , re.IGNORECASE)

print re_s.findall(s)

output:
['link', 'l']

why?
i want my re_s to find linka and la, he just find link and l and forget
about the ending a.

can anyone help me? trying the regular expression in redemo.py (program
provided with python to explore the use of regular expression) i get what
i want, so i guess re_s is ok, but it still fail...
why?

The parentheses () create a capturing group, which specifies that the contentsof the group should be extracted. See the "(...)" entry here:


  http://docs.python.org/library/re#regular-expression-syntax

You can use the non-capturing version of parentheses if you want to just isolatethe | from affecting the rest of the regex:

"""

(?:...) A non-capturing version of regular parentheses. Matches whateverregular expression is inside the parentheses, but the substring matched by thegroup cannot be retrieved after performing a match or referenced later in thepattern.

"""

[~]
|1> import re

[~]
|2> s = "linka la baba"

[~]
|3> re_s = re.compile(r'(?:link|l)a' , re.IGNORECASE)

[~]
|4> print re_s.findall(s)
['linka', 'la']


--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
 that is made terrible by our own mad attempt to interpret it as though it had
 an underlying truth."
  -- Umberto Eco

--
http://mail.python.org/mailman/listinfo/python-list

Re: regular expression i'm going crazy

Reply via email to