Re: HTML to Text

2006-05-03 Thread JupiterHost.Net
And why not post an example of your catch to illustrate it for the benefit of the list? Because I was busy and I knew you would do it ;-) Hee hee, yeah true enough :) But if you know "this exact block of HTML", how about: my @strings = ( "string 1", "string 2", ... ); Because most likele

Re: HTML to Text

2006-05-03 Thread Paul Johnson
On Wed, May 03, 2006 at 10:04:26AM -0500, JupiterHost.Net wrote: > Paul Johnson wrote: > >On Tue, May 02, 2006 at 04:43:34PM -0500, JupiterHost.Net wrote: > > > >>>Basically, right now I just need the HTML to Text output, like I > >>>explained. > > &

Re: HTML to Text

2006-05-03 Thread JupiterHost.Net
Paul Johnson wrote: On Tue, May 02, 2006 at 04:43:34PM -0500, JupiterHost.Net wrote: Basically, right now I just need the HTML to Text output, like I explained. "I want to grab strings between the p tags in this exact block of HTML" to which I would reply: my @strings =

Re: HTML to Text

2006-05-02 Thread Paul Johnson
On Tue, May 02, 2006 at 04:43:34PM -0500, JupiterHost.Net wrote: > >Basically, right now I just need the HTML to Text output, like I explained. > "I want to grab strings between the p tags in this exact block of HTML" > > to which I would reply: > > my @str

Re: HTML to Text

2006-05-02 Thread JupiterHost.Net
it not included? I don't care. ... And why is interesting to you, but not ? those are both "text". You'll need to explain it by more than just one example. What if it's in a table? What if it's the caption for an image? ... Basically, right now I just need

Re: HTML to Text

2006-05-02 Thread Scott Taylor
om a robot, it will always look like that, and at this time, I am only interested in the lines between the not saying I might not be interested in other stuff later on, or doing the same thing with another HTML some day down the road. Either way, an answer to the current question will most li

RE: HTML to Text

2006-05-02 Thread Scott Taylor
On Tue, May 2, 2006 13:45, Russ Foster wrote: >> -Original Message- >> >> >> >> Bin Server >> >> >> Data that I need >> Data that I need >> >> >> >> I want the output to just be lines of "Data that I need" stored in a >> string, that I can work on each line one at a time, or in an

RE: HTML to Text

2006-05-02 Thread Russ Foster
> -Original Message- > > > > Bin Server > > > Data that I need > Data that I need > > > > I want the output to just be lines of "Data that I need" stored in a > string, that I can work on each line one at a time, or in an array or > something like that would be great. I would fir

Re: HTML to Text

2006-05-02 Thread Randal L. Schwartz
> "Scott" == "Scott Taylor" <[EMAIL PROTECTED]> writes: Scott> my simple input file will look like this: Scott> Scott> Scott> Bin Server Scott> Scott> Scott> Data that I need Scott> Data that I need Scott> Scott> Scott> I want the output to just be lines of "Data that I need" stored

HTML to Text

2006-05-02 Thread Scott Taylor
Hello all, I have an HTML file that I need to extract the text data from. What modules do I need to be able to do this? I was looking at HTML::Parser but it doesn't make any sense and probably not what I'm looking for. my simple input file will look like this: Bin Server Data that I nee

Re: converting html to text

2002-04-06 Thread drieux
On Friday, April 5, 2002, at 10:43 , Paul Tremblay wrote: [..] > The problem is that the filter deletes all of my text and ouputs this: > > [TABLE NOT SHOWN][TABLE NOT SHOWN][TABLE NOT SHOWN][TABLE NOT > SHOWN][TABLE NOT SHOWN] Right! that is the big clue I should have seen - there is no 'plain

Re: converting html to text

2002-04-05 Thread Paul Tremblay
On Fri, Apr 05, 2002 at 05:15:08AM -0800, drieux wrote: > > ### #!/usr/bin/perl > ### > ### use HTML::Parser; > ### use HTML::FormatText; > ### use HTML::TreeBuilder; > ### > ### my $html_text; > ### my $filename = $ARGV[0]; > ### open(FH, $filename) or die "unable to open file $filename :$!\n";

Re: converting html to text

2002-04-05 Thread drieux
On Thursday, April 4, 2002, at 01:19 , murphy, daniel (BMC Eng) wrote: > Just did this with the help of "Perl Cookbook" (this book is great). > > Chapter 20.6 Extracting or Removing HTML tags > > use HTML::Parse; > use HTML::FormatText; > $plain_text = HTML::FormatText->new->format(parse_html($h

Re: converting html to text

2002-04-04 Thread Paul Tremblay
On Thu, Apr 04, 2002 at 10:36:36AM -0800, Agustin Rivera wrote: > > Are you looking to keep the basic formatting of the HTML in tact during the > conversion, or just want the HTML stripped? I wouldn't imagine that it > would be that hard to convert the HTML to text if the H

Re: converting html to text

2002-04-04 Thread drieux
On Thursday, April 4, 2002, at 12:12 , tom poe wrote: [..] >> >> That's what the search engine is for >> >> http://search.cpan.org/search?dist=HTML-Format >> >> e. worth remembering is also that this will require the Font-AFM distribution. ciao drieux --- -- To unsubscribe, e-mail: [EMAIL P

RE: converting html to text

2002-04-04 Thread murphy, daniel (BMC Eng)
TED] EMC Corp.508-249-3322 Hopkinton, MA 01748 EMC² where information lives -Original Message- From: Paul Tremblay [mailto:[EMAIL PROTECTED]] Sent: Thursday, April 04, 2002 12:11 PM To: [EMAIL PROTECTED] Subject: converting html to text

Re: converting html to text

2002-04-04 Thread tom poe
am wodering if there isn't a CPAN module already written. > *>Converting html to text seems like such a common task, that there > *>ought to be some robust scripts out there. Interestingly enough, > *>I found many scripts to convert html to rtf and LaTeX and every &g

Re: converting html to text

2002-04-04 Thread Elaine -HFB- Ashton
Paul Tremblay [[EMAIL PROTECTED]] quoth: *> *>I am wodering if there isn't a CPAN module already written. *>Converting html to text seems like such a common task, that there *>ought to be some robust scripts out there. Interestingly enough, *>I found many scripts to convert h

Re: converting html to text

2002-04-04 Thread Agustin Rivera
Are you looking to keep the basic formatting of the HTML in tact during the conversion, or just want the HTML stripped? I wouldn't imagine that it would be that hard to convert the HTML to text if the HTML wasn't overly complicated. Agustin Rivera Webmaster, Pollstar.com http://www.po

converting html to text

2002-04-04 Thread Paul Tremblay
CPAN module already written. Converting html to text seems like such a common task, that there ought to be some robust scripts out there. Interestingly enough, I found many scripts to convert html to rtf and LaTeX and every other format, but not plain old text! Paul --

Converting HTML to text?

2002-01-06 Thread Andy Ransom
Hi, I have a requirement to convert HTML files to plain text within a perl script, and I need to preserve the formating of HTML table as far as possible, say something like Netscape does when you do a "save as text" operation. I have looked on CPAN but could not find anything appropriate (altho

Re: HTML to text

2001-12-20 Thread Curtis Poe
--- "McCollum, Frank" <[EMAIL PROTECTED]> wrote: > Does anyone know where to find resources on parsing textual content out of > an HTML page? I am trying to grab values out of a table on a website. I > can grab the table, but I am having trouble grabbing the actual values from > it... Frank, Y

HTML to text

2001-12-20 Thread McCollum, Frank
Does anyone know where to find resources on parsing textual content out of an HTML page? I am trying to grab values out of a table on a website. I can grab the table, but I am having trouble grabbing the actual values from it... Thanks Frank McCollum Bank Of America Securities, LLC [EMAIL PROT

HTML to Text

2001-10-04 Thread Sunthari
Dear List, I have used the WWW::Search module to make search for words and to get url and desription for the search. I want to use the HTML ::TreeBuider or HTML::Parser to scan thru the pages to that I could do a match for concordance from each webpage. How should I go abt it? The problem is, a