Gidday All,
I am writing a print this page script.
I have slurped in the page to be printed and now want to strip out the stuff to print.
To do this I have created the following tag sets in the html page.
<date></date> <headline></headline> <story></story>
I need to write a regex to achieve this.
In general attempting to write regexes to parse HTML is a bad idea. Now having said that...
Assuming you are only after these three sets of tags and they have no attributes, and I am assuming they have some content that you want to grab, you can start out simple and work your way up from there....
/<date>(.*)<\/date>/
This has a number of problems, but might work. What have you tried, what does an actual example of the data look like, and how are you handling the actual data (aka looping using a while <FH>? foreach over a split, etc.)??
Give us more, and we will reply in kind....
http://danconia.org
-- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]