On Friday, April 5, 2002, at 10:43 , Paul Tremblay wrote:
[..]
> The problem is that the filter deletes all of my text and ouputs this:
>
> [TABLE NOT SHOWN][TABLE NOT SHOWN][TABLE NOT SHOWN][TABLE NOT
> SHOWN][TABLE NOT SHOWN]
Right! that is the big clue I should have seen - there is no
'plain
On Fri, Apr 05, 2002 at 05:15:08AM -0800, drieux wrote:
>
> ### #!/usr/bin/perl
> ###
> ### use HTML::Parser;
> ### use HTML::FormatText;
> ### use HTML::TreeBuilder;
> ###
> ### my $html_text;
> ### my $filename = $ARGV[0];
> ### open(FH, $filename) or die "unable to open file $filename :$!\n";
On Thursday, April 4, 2002, at 01:19 , murphy, daniel (BMC Eng) wrote:
> Just did this with the help of "Perl Cookbook" (this book is great).
>
> Chapter 20.6 Extracting or Removing HTML tags
>
> use HTML::Parse;
> use HTML::FormatText;
> $plain_text = HTML::FormatText->new->format(parse_html($h
On Thu, Apr 04, 2002 at 10:36:36AM -0800, Agustin Rivera wrote:
>
> Are you looking to keep the basic formatting of the HTML in tact during the
> conversion, or just want the HTML stripped? I wouldn't imagine that it
> would be that hard to convert the HTML to text if the HTML wasn't overly
> c
On Thursday, April 4, 2002, at 12:12 , tom poe wrote:
[..]
>>
>> That's what the search engine is for
>>
>> http://search.cpan.org/search?dist=HTML-Format
>>
>> e.
worth remembering is also that this will require
the Font-AFM distribution.
ciao
drieux
---
--
To unsubscribe, e-mail: [EMAIL P
Just did this with the help of "Perl Cookbook" (this book is great).
Chapter 20.6 Extracting or Removing HTML tags
use HTML::Parse;
use HTML::FormatText;
$plain_text = HTML::FormatText->new->format(parse_html($html_text));
Dan Murphy [EMAIL PROTECTED]
EMC Corp.
Hi, elaine: Did you skip a step? How do we get from convert HTML to
HTML-Format / distribution? Now, that secret is as good as the one about
Life?!
Thanks,
Tom
On Thursday 04 April 2002 10:52, Elaine -HFB- Ashton wrote:
> Paul Tremblay [[EMAIL PROTECTED]] quoth:
> *>
> *>I am wodering if
Paul Tremblay [[EMAIL PROTECTED]] quoth:
*>
*>I am wodering if there isn't a CPAN module already written.
*>Converting html to text seems like such a common task, that there
*>ought to be some robust scripts out there. Interestingly enough,
*>I found many scripts to convert html to rtf and LaTeX a
Are you looking to keep the basic formatting of the HTML in tact during the
conversion, or just want the HTML stripped? I wouldn't imagine that it
would be that hard to convert the HTML to text if the HTML wasn't overly
complicated.
Agustin Rivera
Webmaster, Pollstar.com
http://www.pollstar.com