Re: Python regular expressions just ain't PCRE

2007-05-08 Thread John Machin
On May 9, 7:34 am, Klaas <[EMAIL PROTECTED]> wrote: > On May 5, 6:57 pm, Wiseman <[EMAIL PROTECTED]> wrote: > > > > There's also the YAGNI factor; most folk would restrict using regular > > > expressions to simple grep-like functionality and data validation -- > > > e.g. re.match("[A-Z][A-Z]?[0-9]{

Re: Python regular expressions just ain't PCRE

2007-05-08 Thread Klaas
On May 5, 6:57 pm, Wiseman <[EMAIL PROTECTED]> wrote: > > There's also the YAGNI factor; most folk would restrict using regular > > expressions to simple grep-like functionality and data validation -- > > e.g. re.match("[A-Z][A-Z]?[0-9]{6}[0-9A]$", idno). The few who want to > > recognise yet anot

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread Wiseman
On May 5, 10:44 pm, John Machin <[EMAIL PROTECTED]> wrote: > "UTF-8 Unicode" is meaningless. Python has internal unicode string > objects, with comprehensive support for converting to/from str (8-bit) > string objects. The re module supports unicode patterns and strings. > PCRE "supports" patterns

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread Wiseman
On May 5, 10:06 pm, "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote: > -1 on this from me. In the past 10 years as a professional > programmer, I've used the wierd extended "regex" features maybe 5 > times total, whether it be in Perl or Python. In contrast, I've had > to work around the slowness o

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread Wiseman
On May 5, 6:28 pm, [EMAIL PROTECTED] wrote: > I'm not sure what your skill level is, but I would suggest studying the > code, starting in on a patch for one or more of these features, and then > corresponding with the module's maintainers to improve your patch to the > point where it can be accept

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread John Machin
On May 6, 1:52 am, Wiseman <[EMAIL PROTECTED]> wrote: > On May 5, 5:12 am, "Terry Reedy" <[EMAIL PROTECTED]> wrote: > > > I believe the current Python re module was written to replace the Python > > wrapping of pcre in order to support unicode. > > I don't know how PCRE was back then, but right now

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread [EMAIL PROTECTED]
Wiseman wrote: > I'm kind of disappointed with the re regular expressions module. In > particular, the lack of support for recursion ( (?R) or (?n) ) is a > major drawback to me. There are so many great things that can be > accomplished with regular expressions this way, such as validating a > math

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread Martin v. Löwis
> Are there any plans to support these features in re? This question is impossible to answer. I don't have such plans, and I don't know of any, but how could I speak for the hundreds of contributors to Python world-wide, including those future contributors which haven't contributed *yet*. Do you

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread dustin
On Sat, May 05, 2007 at 08:52:15AM -0700, Wiseman wrote: > > I believe the current Python re module was written to replace the Python > > wrapping of pcre in order to support unicode. > > I don't know how PCRE was back then, but right now it supports UTF-8 > Unicode patterns and strings, and Unico

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread Wiseman
On May 5, 7:19 am, Marc 'BlackJack' Rintsch <[EMAIL PROTECTED]> wrote: > In <[EMAIL PROTECTED]>, Wiseman wrote: > > Note: I know there are LALR parser generators/parsers for Python, but > > the very reason why re exists is to provide a much simpler, more > > productive way to parse or validate simp

Re: Python regular expressions just ain't PCRE

2007-05-05 Thread Wiseman
On May 5, 5:12 am, "Terry Reedy" <[EMAIL PROTECTED]> wrote: > I believe the current Python re module was written to replace the Python > wrapping of pcre in order to support unicode. I don't know how PCRE was back then, but right now it supports UTF-8 Unicode patterns and strings, and Unicode cha

Re: Python regular expressions just ain't PCRE

2007-05-04 Thread Marc 'BlackJack' Rintsch
In <[EMAIL PROTECTED]>, Wiseman wrote: > Note: I know there are LALR parser generators/parsers for Python, but > the very reason why re exists is to provide a much simpler, more > productive way to parse or validate simple languages and process text. > (The pyparse/yappy/yapps/ generator here> arg

Re: Python regular expressions just ain't PCRE

2007-05-04 Thread Terry Reedy
"Wiseman" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] | I'm kind of disappointed with the re regular expressions module. I believe the current Python re module was written to replace the Python wrapping of pcre in order to support unicode. | In particular, the lack of support f

Python regular expressions just ain't PCRE

2007-05-04 Thread Wiseman
I'm kind of disappointed with the re regular expressions module. In particular, the lack of support for recursion ( (?R) or (?n) ) is a major drawback to me. There are so many great things that can be accomplished with regular expressions this way, such as validating a mathematical expression or pa