And why not post an example of your catch to illustrate it for the
benefit of the list?
Because I was busy and I knew you would do it ;-)
Hee hee, yeah true enough :)
But if you know "this exact block of HTML", how about:
my @strings = ( "string 1", "string 2", ... );
Because most likele
On Wed, May 03, 2006 at 10:04:26AM -0500, JupiterHost.Net wrote:
> Paul Johnson wrote:
> >On Tue, May 02, 2006 at 04:43:34PM -0500, JupiterHost.Net wrote:
> >
> >>>Basically, right now I just need the HTML to Text output, like I
> >>>explained.
> >
&
Paul Johnson wrote:
On Tue, May 02, 2006 at 04:43:34PM -0500, JupiterHost.Net wrote:
Basically, right now I just need the HTML to Text output, like I explained.
"I want to grab strings between the p tags in this exact block of HTML"
to which I would reply:
my @strings =
On Tue, May 02, 2006 at 04:43:34PM -0500, JupiterHost.Net wrote:
> >Basically, right now I just need the HTML to Text output, like I explained.
> "I want to grab strings between the p tags in this exact block of HTML"
>
> to which I would reply:
>
> my @str
it not
included?
I don't care.
...
And why is interesting to you, but not ? those are both "text".
You'll need to explain it by more than just one example. What
if it's in a table? What if it's the caption for an image?
...
Basically, right now I just need
om a robot, it
will always look like that, and at this time, I am only interested in the
lines between the not saying I might not be interested in other stuff
later on, or doing the same thing with another HTML some day down the
road. Either way, an answer to the current question will most li
On Tue, May 2, 2006 13:45, Russ Foster wrote:
>> -Original Message-
>>
>>
>>
>> Bin Server
>>
>>
>> Data that I need
>> Data that I need
>>
>>
>>
>> I want the output to just be lines of "Data that I need" stored in a
>> string, that I can work on each line one at a time, or in an
> -Original Message-
>
>
>
> Bin Server
>
>
> Data that I need
> Data that I need
>
>
>
> I want the output to just be lines of "Data that I need" stored in a
> string, that I can work on each line one at a time, or in an array or
> something like that would be great.
I would fir
> "Scott" == "Scott Taylor" <[EMAIL PROTECTED]> writes:
Scott> my simple input file will look like this:
Scott>
Scott>
Scott> Bin Server
Scott>
Scott>
Scott> Data that I need
Scott> Data that I need
Scott>
Scott>
Scott> I want the output to just be lines of "Data that I need" stored
Hello all,
I have an HTML file that I need to extract the text data from. What
modules do I need to be able to do this?
I was looking at HTML::Parser but it doesn't make any sense and probably
not what I'm looking for.
my simple input file will look like this:
Bin Server
Data that I nee
On Friday, April 5, 2002, at 10:43 , Paul Tremblay wrote:
[..]
> The problem is that the filter deletes all of my text and ouputs this:
>
> [TABLE NOT SHOWN][TABLE NOT SHOWN][TABLE NOT SHOWN][TABLE NOT
> SHOWN][TABLE NOT SHOWN]
Right! that is the big clue I should have seen - there is no
'plain
On Fri, Apr 05, 2002 at 05:15:08AM -0800, drieux wrote:
>
> ### #!/usr/bin/perl
> ###
> ### use HTML::Parser;
> ### use HTML::FormatText;
> ### use HTML::TreeBuilder;
> ###
> ### my $html_text;
> ### my $filename = $ARGV[0];
> ### open(FH, $filename) or die "unable to open file $filename :$!\n";
On Thursday, April 4, 2002, at 01:19 , murphy, daniel (BMC Eng) wrote:
> Just did this with the help of "Perl Cookbook" (this book is great).
>
> Chapter 20.6 Extracting or Removing HTML tags
>
> use HTML::Parse;
> use HTML::FormatText;
> $plain_text = HTML::FormatText->new->format(parse_html($h
On Thu, Apr 04, 2002 at 10:36:36AM -0800, Agustin Rivera wrote:
>
> Are you looking to keep the basic formatting of the HTML in tact during the
> conversion, or just want the HTML stripped? I wouldn't imagine that it
> would be that hard to convert the HTML to text if the H
On Thursday, April 4, 2002, at 12:12 , tom poe wrote:
[..]
>>
>> That's what the search engine is for
>>
>> http://search.cpan.org/search?dist=HTML-Format
>>
>> e.
worth remembering is also that this will require
the Font-AFM distribution.
ciao
drieux
---
--
To unsubscribe, e-mail: [EMAIL P
TED]
EMC Corp.508-249-3322
Hopkinton, MA 01748
EMC²
where information lives
-Original Message-
From: Paul Tremblay [mailto:[EMAIL PROTECTED]]
Sent: Thursday, April 04, 2002 12:11 PM
To: [EMAIL PROTECTED]
Subject: converting html to text
am wodering if there isn't a CPAN module already written.
> *>Converting html to text seems like such a common task, that there
> *>ought to be some robust scripts out there. Interestingly enough,
> *>I found many scripts to convert html to rtf and LaTeX and every
&g
Paul Tremblay [[EMAIL PROTECTED]] quoth:
*>
*>I am wodering if there isn't a CPAN module already written.
*>Converting html to text seems like such a common task, that there
*>ought to be some robust scripts out there. Interestingly enough,
*>I found many scripts to convert h
Are you looking to keep the basic formatting of the HTML in tact during the
conversion, or just want the HTML stripped? I wouldn't imagine that it
would be that hard to convert the HTML to text if the HTML wasn't overly
complicated.
Agustin Rivera
Webmaster, Pollstar.com
http://www.po
CPAN module already written.
Converting html to text seems like such a common task, that there
ought to be some robust scripts out there. Interestingly enough,
I found many scripts to convert html to rtf and LaTeX and every
other format, but not plain old text!
Paul
--
Hi,
I have a requirement to convert HTML files to plain text within a perl
script, and I need to preserve the formating of HTML table as far as
possible, say something like Netscape does when you do a "save as text"
operation.
I have looked on CPAN but could not find anything appropriate (altho
--- "McCollum, Frank" <[EMAIL PROTECTED]> wrote:
> Does anyone know where to find resources on parsing textual content out of
> an HTML page? I am trying to grab values out of a table on a website. I
> can grab the table, but I am having trouble grabbing the actual values from
> it...
Frank,
Y
Does anyone know where to find resources on parsing textual content out of
an HTML page? I am trying to grab values out of a table on a website. I
can grab the table, but I am having trouble grabbing the actual values from
it...
Thanks
Frank McCollum
Bank Of America Securities, LLC
[EMAIL PROT
Dear List,
I have used the WWW::Search module to make search for
words and to get url and desription for the search.
I want to use the HTML ::TreeBuider or HTML::Parser to
scan thru the pages to that I could do a match for
concordance from each webpage. How should I go abt it?
The problem is, a
24 matches
Mail list logo