Re: HTML::TreeBuilder - handle invalid html gracefully

2009-08-23 Thread Roman Makurin
On Sun, Aug 23, 2009 at 02:56:44PM +0400, Roman Makurin wrote: > Hi All! > > How can I tell HTML::TreeBuilder to parse invalid html files > gracefully ? Here is an example: > > - > #!/usr/bin/perl > > use strict; > use warnings; > > use HTML::TreeBuilder; > > my $root = HTML::TreeBuilder->

Re: HTML::TreeBuilder encode symbols as html entities

2009-08-14 Thread Roman Makurin
On Fri, Aug 14, 2009 at 5:35 PM, Shawn H. Corey wrote: > Roman Makurin wrote: >> >> dump result is html encoded entities: >> >> @0.1.5.1 >>  > title="Ссылка ">@0.1.5.1.0 >> >> all html entities are valid unicode code points of symbols. But why >> HTML::TreeBuilder convert symbols to entities ? > >

Re: HTML::TreeBuilder encode symbols as html entities

2009-08-14 Thread Shawn H. Corey
Roman Makurin wrote: dump result is html encoded entities: @0.1.5.1 @0.1.5.1.0 all html entities are valid unicode code points of symbols. But why HTML::TreeBuilder convert symbols to entities ? Because some browsers do not understand Unicode. Or they didn't. If I just do print $conten

Re: HTML::TreeBuilder help

2008-07-17 Thread Rob Dixon
Ryan wrote: > > The Dump method gives me this: > cellpadding="0" cellspacing="0" width="100%"> @0.1.1.0.1.0.0.0.0.1 > @0.1.1.0.1.0.0.0.0.1.0 >@0.1.1.0.1.0.0.0.0.1.0.0 > > How can I make use of "@0.1.1.0.1.0.0.0.0.1" if I know that's t

Re: HTML::TreeBuilder - finding a text element

2007-03-25 Thread Rob Dixon
Brandino Andreas wrote: Hi list I am using HTML::TreeBuilder to parse a html page and find a specific value. When i dump the array i get this: $tree->dump(); more.. @0.1.0.1.1.0.0.0.0 @0.1.0.1.1.0.0.0.0.0 "MAC Address"

Re: HTML::TreeBuilder

2002-02-06 Thread Peter Scott
At 12:14 PM 2/7/02 +1030, Daniel Falkenberg wrote: >I am currently working with the perl CPAN module HTML::TreeBuilder. Is >someone able to explain to me if this is the module I want to use to >extract data from a HTML page. Now this HTML page is contained outside of >my perl-cgi script. Fro

Re: HTML::Treebuilder

2001-12-20 Thread Michael Fowler
On Wed, Dec 19, 2001 at 05:46:15PM -0500, McCollum, Frank wrote: > I do not understand what is meant by 'depth' in this module (i've read the > accompanying documentation, but I didn't follow it well). Does anyone know > where a good description is? The HTML::Element documentation has an intro