Re: Pyparsing: Non-greedy matching?

Peter Fein Thu, 30 Dec 2004 19:24:33 -0800

On 12/31/04 03:00 AM CST, "Paul McGuire" <[EMAIL PROTECTED]>
sayeth:
> Assuming that your <td> tag wont contain any nested <td> tag, you
> could define your data content as "everything up until I find
> '</td>'".  For this you can use pyparsing's SkipTo element.  I think
> if you define data as:
>     data = SkipTo("</td>")
> then your code should start working better.


Hey!  It does! ;) I just worked up (note to googlers- don't do this):
goodchars=printables.replace("<", "")
good_ab="<" + (~Literal(r"""/td>""") + SkipTo(">", include=True))
good_ab.setDebug(True)
simple=Word(goodchars)
complex=Combine(simple | good_ab, adjacent=False, joinString="")
data=Combine(OneOrMore(complex), adjacent=False, joinString=" ")
title4=title_open+data

But that would break for closers with more than one ">".  Need to stop
thinking like these are regexps.  Thanks - this is a great tool. ;)

-- 
Peter Fein                 [EMAIL PROTECTED]                 773-575-0694

Basically, if you're not a utopianist, you're a schmuck. -J. Feldman
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Pyparsing: Non-greedy matching?

Reply via email to