You can try HTML::FormatText for this. It doesn't support tables and forms,
though.

##########################

require HTML::TreeBuilder;
$tree = HTML::TreeBuilder->new->parse_file("test.html");

require HTML::FormatText;
$formatter = HTML::FormatText->new(leftmargin => 0, rightmargin => 50);
print $formatter->format($tree);

##########################

Scot Robnett
inSite Internet Solutions
[EMAIL PROTECTED]
http://www.insiteful.tv





-----Original Message-----
From: Octavian Rasnita [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, July 17, 2002 2:55 AM
To: [EMAIL PROTECTED]
Subject: HTML tags - module


Hi all,

I want to get a web page and remove all the HTML tags from it, then save the
visible text only. Like saving the file as text from Internet Explorer.

Do you know a Perl module that can help me to find and remove all the HTML
tags?
I was thinking to use regular expressions, but I may forget a lot of things.

Thank you.

Teddy Center: http://teddy.fcc.ro/
Mail: [EMAIL PROTECTED]



--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to