Mark, Thank you so much for your help, that worked great! It turns out that I already had the latest version of HTML::Tokeparser::Simple installed.
Thanks again! Brian > -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > Sent: Thursday, December 23, 2004 1:22 AM > To: Brian Volk > Cc: Beginners (E-mail) > Subject: Re: LWP get only <center> img > > > > > > ----- Original Message ----- > From: Brian Volk <[EMAIL PROTECTED]> > Date: Wednesday, December 22, 2004 12:59 pm > Subject: LWP get only <center> img > > > Hi All, > > Hello > > > > I have a list of url source files... I need to get a certain "<img > > src="from each file. The one thing that separates it from the > > other <img src > > tags is it is preceded by <center> for example: <center><img > > src="/rcp/ObjectServer?table=Images&id=381" but the sequence of > > img tags is > > different in each of the files. Is there a way to get the img > > 'src' tag if > > the img tag is eq to <center>? Maybe I could write a regex to do > > this?pointers? > The module you are trying to use already has everything you > need for the task. > > > > I've broken my script down to try and get the <center> <img scr= > > from just > > one source file. > > > > Below is one attempt where I thought I was getting close ... > > maybe not... > > :~). Any suggestions would be greatly appreciated. > > You are real close, you need to use a few other functions > from the module. > > > > > > > > #!/usr/bin/perl > > > > use strict; > > use warnings; > > use HTML::TokeParser::Simple; > > use LWP::Simple; > > > > my $url = " > > http://www.rcpworksmarter.com/rcp/products/detail.jsp?rcpNum=1013 > > > <http://www.rcpworksmarter.com/rcp/products/detail.jsp?rcpNum=1013> "; > > my $page = get($url) > > or die "Could not load URL\n"; > You can avoid all of that if you download the latest release > of HTML::TokeParser::Simple from CPAN. > > > > > my $parser = HTML::TokeParser::Simple->new(\$page) > > or die "Could not parse page"; > > > > $parser->get_tag ("img") || die; > > my $token = $parser->get_token; > > if ($token->[0] eq "center"); > > print; > > > > # ---end --- > > Here is one way to do it. It's not a compleate deal, but will > work for the test page you have supplied and as a learning base. > > #!/usr/bin/perl > > use strict; > use warnings; > use HTML::TokeParser::Simple; > > > my $url = 'http://www.rcpworksmarter.com/rcp/products/detail.jsp?rcpNum=1013'; my $parser = HTML::TokeParser::Simple->new(url => $url) or die "Could not parse page"; while ( my $token = $parser->get_token ) { if ( $token->is_start_tag( 'center' ) ) { my $TAG = $parser->get_token(); print $TAG->get_attr('src'); } } HTH, Mark G. P.S. How about a free garbige bin for getting you on the way :O) > > > Brian Volk > HP Products > 317.298.9950 x1245 > <[EMAIL PROTECTED]> [EMAIL PROTECTED] > > > -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] <http://learn.perl.org/> <http://learn.perl.org/first-response> -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] <http://learn.perl.org/> <http://learn.perl.org/first-response>