The script you sent me does get the imagses, mine doesn't so I screwed up somewhere along the way. I'll take your original and modify it one step at a time to narrow down what I did wrong.
I'll post back when I get it right so that hopefully someone can learn from my dumbness. Thanks Dan > Funny, here's the script since I modified it perhaps I jacked > somehting up > > Also I had it checking meta tags :: > > my $name = $token->[1]{name} || "-"; > my $http = $token->[1]{http-equiv} || "-"; > my $cont = $token->[1]{content} || "-"; > > That would grab content for all of them, > The name if it was a name and if it was an http-equiv it > would make come out as - > > Not sure what I'm doing wrong but any way here's my script :: Thanks > > #!/usr/bin/perl > > use LWP::Simple; > use HTML::TokeParser; > > $url = $ARGV[0]; > > $content = get($url); > > $p = HTML::TokeParser->new(\$content); > my %text_links_con; > my %img_con; > my %meta_con; > > while(1) { > my $token = $p->get_tag("a","img","title","meta","rel"); > last unless($token); > > if($token->[0] eq 'img') { > $x++; # I'm doing this so I can get a count > of how many it's finding > my $src = $token->[1]{src} || "-"; > my $alt = $token->[1]{alt} || "-"; > $img_con{$x} = "SRC : $src, ALT :$alt";; > } > elsif($token->[0] eq 'a') { > $i++; > my $url = $token->[1]{href} || "-"; > my $text = $p->get_trimmed_text("/a"); > $text_links_con{$i} = "Text : $text, URL : $url"; > } > elsif($token->[0] eq 'title') { > $title_con = $p->get_trimmed_text; > } > > } > > print "TITLE : $title_con \n"; > foreach $q(keys %text_links_con) { print "LINK : $q - > $text_links_con{$q} \n"; } foreach $x(keys %img_con) { print > "IMG : $x - $img_con{$x} \n"; } > > > > > what do you mean? the following seems to be working: > > > > #!/usr/bin/perl -w > > use strict; > > > > use HTML::TokeParser; > > > > my $tok = new HTML::TokeParser(*DATA) || die $!; > > while(1){ > > my $token = $tok->get_tag("a","img"); > > last unless($token); > > if($token->[0] eq 'a'){ > > print $token->[1]{href} || "what?","\n"; > > }else{ > > print $token->[1]{src} || "again?","\n"; > > } > > } > > > > __DATA__ > > <html> > > <body> > > <a href=link1>link1</a> > > <img src=img1>img1</img> > > <a href=link2><img src=img_inside_a></img></a> > > </body> > > </html> > > > > prints: > > > > link1 > > img1 > > link2 > > img_inside_a > > > > so img_inside_a does show up. am i missing something? > > > > david > > > > -- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > -- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]