You can try HTML::FormatText for this. It doesn't support tables and forms,
though.
##########################
require HTML::TreeBuilder;
$tree = HTML::TreeBuilder->new->parse_file("test.html");
require HTML::FormatText;
$formatter = HTML::FormatText->new(leftmargin => 0, rightmargin => 50);
print $formatter->format($tree);
##########################
Scot Robnett
inSite Internet Solutions
[EMAIL PROTECTED]
http://www.insiteful.tv
-----Original Message-----
From: Octavian Rasnita [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, July 17, 2002 2:55 AM
To: [EMAIL PROTECTED]
Subject: HTML tags - module
Hi all,
I want to get a web page and remove all the HTML tags from it, then save the
visible text only. Like saving the file as text from Internet Explorer.
Do you know a Perl module that can help me to find and remove all the HTML
tags?
I was thinking to use regular expressions, but I may forget a lot of things.
Thank you.
Teddy Center: http://teddy.fcc.ro/
Mail: [EMAIL PROTECTED]
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]