Take it out of the loop and add /s if it could span lines. On Mon, Nov 25, 2013 at 2:44 PM, Mike Blezien <mick...@frontiernet.net> wrote: > Thank you for your and the other list members suggestions, gives me a good > place to start. > > > Mike(mickalo)Blezien > =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= > Thunder Rain Internet Publishing > -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- > > ----- Original Message ----- > From: Jim Gibson > To: Perl List > Sent: Monday, November 25, 2013 1:23 PM > Subject: Re: Regrex Question > > > On Nov 25, 2013, at 10:55 AM, Mike Blezien wrote: > >> Hello, >> >> Regular expression have never been my strong suite so hoping to get a >> litte help with a line in file I need to extract a portion of it. >> >> The text I need to extract from this line is "November 21, 2013" from this >> line in the file, just the date: >> >> Posted by <a href="mailto:someem...@email.com">Some Name</a> on November >> 21, 2013 at 23:21:58:<p> >> >> what would be the regrex to use to extract the date from the line ? > > The usual advice applies: don't use regular expressions to parse HTML. > However, lots of people do it anyway, myself included. Your success at > extracting usable data depends upon how rigid the format of the HTML is from > page to page. > > In your case, if the date always follows a link ('</a>') followed by 'on', > and the date is always followed by 'at' and a time. you can use this: > > if( $line =~ m{ </a> \s+ on \s+ (\w+ \s \d{1,2} , \s \d{4}) \s at}x ) { > print "The date is $1\n"; > }else{ > print "No match\n"; > } > > Note I am using the extended regular expression syntax with the x modifier. > > > -- > To unsubscribe, e-mail: beginners-unsubscr...@perl.org > For additional commands, e-mail: beginners-h...@perl.org > http://learn.perl.org/ > >
-- To unsubscribe, e-mail: beginners-unsubscr...@perl.org For additional commands, e-mail: beginners-h...@perl.org http://learn.perl.org/