>>>>> "Scott" == "Scott Taylor" <[EMAIL PROTECTED]> writes:

Scott> my simple input file will look like this:

Scott> <HTML>
Scott> <HEAD>
Scott>   <TITLE>Bin Server</TITLE>
Scott> </HEAD>
Scott> <BODY>
Scott> <p>Data that I need</p>
Scott> <p>Data that I need</p>
Scott> </BODY>
Scott> </HTML>

Scott> I want the output to just be lines of "Data that I need" stored in a
Scott> string, that I can work on each line one at a time, or in an array or
Scott> something like that would be great.

Your specification is incomplete.

What if it says:

        <p>Data that I <b>need</b></p>

Do you want <b> in your response?  Or stripped?  Or that part of it
not included?

And why is <p> interesting to you, but not <title>?  those are both "text".
You'll need to explain it by more than just one example.  What
if it's in a table?  What if it's the caption for an image?

And this is also why there are very few "off the shelf" solutions.  Everyone
has *different* requirements.  And yes, HTML::Parser is one of the best
ways to precisely define what you want, and then you can get it.

-- 
Randal L. Schwartz - Stonehenge Consulting Services, Inc. - +1 503 777 0095
<merlyn@stonehenge.com> <URL:http://www.stonehenge.com/merlyn/>
Perl/Unix/security consulting, Technical writing, Comedy, etc. etc.
See PerlTraining.Stonehenge.com for onsite and open-enrollment Perl training!

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>


Reply via email to