Re: Regular expression, "not this string"

Dave Cardwell Mon, 12 Mar 2007 09:54:17 -0800

Rob Dixon wrote:

Dave Cardwell wrote:
Hello there, I'm having trouble constructing a regular expression thatwould do the following:
FOO...
...followed by anything but BAR (non-greedy)...
...followed by BAZ (captured)...
...followed by anything but BAR (greedy)...
...followed by BAR
I've been looking at zero-width negative look-ahead, but I haven'tused this area of regular expressions before so I'm struggling. Asolution or prod in the right direction would be lovely.
Please show us the real problem. I know you mean to clarify, but your
summary is so ambiguous that understanding it becomes the most difficult
part of providing a solution.

Thanks,

Rob

I was afraid of that, sorry. I'm using HTML::Parser to scan through adocument, but I need to do one quick manipulation first that depends onseeing the document as a whole (unlike per-token as with HTML::Parser).Rather than attempting to fit all of the real work in a regularexpression, I thought it best to simply mark the element with a customattribute that HTML::Parser could pick up later.

To that end, I need to find an <a> (BAZ) that contains just plain text,somewhere between an opening <td> (FOO) and the closest closing </td>(BAR), ie something along the lines of:


s%
    <td([^>]*>
        {not </td>}*?
            <a[^>]*>[\w\s]+</a>
        {not </td>}*?
    </td>)
%<td foo="1"$1%gismx;

It's the {not </td>} bits I'm having difficulty with.


--
Best wishes,
Dave Cardwell.

http://perlprogrammer.co.uk/


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Re: Regular expression, "not this string"

Reply via email to