>>>>> "Scott" == "Scott Taylor" <[EMAIL PROTECTED]> writes:
Scott> my simple input file will look like this:
Scott> <HTML>
Scott> <HEAD>
Scott> <TITLE>Bin Server</TITLE>
Scott> </HEAD>
Scott> <BODY>
Scott> <p>Data that I need</p>
Scott> <p>Data that I need</p>
Scott> </BODY>
Scott> </HTML>
Scott> I want the output to just be lines of "Data that I need" stored in a
Scott> string, that I can work on each line one at a time, or in an array or
Scott> something like that would be great.
Your specification is incomplete.
What if it says:
<p>Data that I <b>need</b></p>
Do you want <b> in your response? Or stripped? Or that part of it
not included?
And why is <p> interesting to you, but not <title>? those are both "text".
You'll need to explain it by more than just one example. What
if it's in a table? What if it's the caption for an image?
And this is also why there are very few "off the shelf" solutions. Everyone
has *different* requirements. And yes, HTML::Parser is one of the best
ways to precisely define what you want, and then you can get it.
--
Randal L. Schwartz - Stonehenge Consulting Services, Inc. - +1 503 777 0095
<[email protected]> <URL:http://www.stonehenge.com/merlyn/>
Perl/Unix/security consulting, Technical writing, Comedy, etc. etc.
See PerlTraining.Stonehenge.com for onsite and open-enrollment Perl training!
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>