Re: Help understanding why the RE does not totally work

John W. Krahn Fri, 26 Sep 2008 12:12:36 -0700

Jack Gates wrote:

On Friday 26 September 2008 12:48:14 pm Jack Gates wrote:
s!(<|</)([^\!][A-Z0-9 ]+>)!$1\L$2\E!g;
or
s/(<|<\/)([^!][A-Z0-9 ]+>)/$1\L$2\E/g;
The RE above captures and replaces all HTML tags with lowercase
as desired except for any tag that has only one letter such as
, or 

It will get the , and 

It properly ignores the <!DOCTYPE> tag

What is the correct way to write the above RE?
John helped me achieve what I wanted. His RE sample got me to theright place after tweaking it a little.
I would like to understand why what I originally had did not work,so I can learn better. Will some one show me why my original RE didnot work? Meaning it did not get the tags as explained abovepreviously.

Match '<' or '</' followed by one character that is not '!' followed byone or more of 'A-Z0-9 ' followed by '>'.

Your pattern will not match '' because that contains three charactersbut your pattern has to match at least four characters.




John
--
Perl isn't a toolbox, but a small machine shop where you
can special-order certain sorts of tools at low cost and
in short order.                            -- Larry Wall

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Re: Help understanding why the RE does not totally work

Reply via email to