Re: Extracting TD's from a Text File (Regex Help).

Sean Davis Tue, 08 Apr 2008 09:03:00 -0700

On Tue, Apr 8, 2008 at 8:22 AM,  <[EMAIL PROTECTED]> wrote:
> ################ TEXT FILE ##################
>  <td class="PhorumTableRowAlt thread"  style="padding-left: 0px">
>
>             <a href="http://mysite.com/link/here_goes?id=239";>LINK</a>
>
>     &nbsp;<span class="PhorumNewFlag"></span></td>
>
>   <td class="PhorumTableRowAlt" nowrap="nowrap" width="150">
>   <a href="http://mysite.com/link/here_goes?id=239";>LINK</a> </td>
>     <td class="PhorumTableRowAlt PhorumSmallFont" nowrap="nowrap" 
> width="150">06/11/2007 12:29AM
>   </td>
>  </tr>
>  ############################################
>
>  The text file contains hundreds of tds structure like above. All I need is 
> to extract the td with class "PhorumTableRowAlt thread". I have tried every 
> possible option, but finally I am coming to you for any Regex for it? TIA.
>
>  HERE IS WHAT I AM DOING:
>
>  pen(TXT, "links.txt") or die "Unable to open file";
>  my @links = <TXT>;
>  close (TXT);
>  foreach my $link(@links) {
>  if ($link =~ m|<td class="PhorumTableRow thread" style="padding-left: 
> 0px">(.*?)</td>|gsi) {
>  print "$1";}
>  }
>
>
>
>  But NOTHING coming up. No results.


I didn't look closely at your code--sorry.  But when I see questions
related to HTML parsing, I always wonder if it isn't better to use a
module built to deal with HTML.

http://search.cpan.org/dist/HTML-Parser/

is just one example.

Sean

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Re: Extracting TD's from a Text File (Regex Help).

Reply via email to