On Fri, 2002-09-27 at 16:53, John Holmes wrote:
This isn't accurate enough because <DT> is not always preceeded by:
<DD> some text. It is sometimes preceeded by <DT> some text </DT> or
other items.

This expression matches fairly well:
<DD>[a-zA-Z0-9\.,'\-\s]*
So it matches up to the <DT>:
<DD> A whole bunch of text      
        <DT> Something else </DT>

But I don't know how to do the replace part.
So essentially I need find this:
<DD> A whole bunch of text 

and replace it with this:
<DD> A whole bunch of text </DD>
 
> 
> $new_text = str_replace("\n\t<DT>","</DD>\n\t<DT>",$your_text);
> 
> Assuming there is a tab in there...
> 
> You'd be better off getting an editor that'll do this for you. Textpad
> lets you search and replace with regular expressions. Other programs do
> too.
> 
> ---John Holmes...
> 
> > I have a fairly large html document that I need to convert to xml.
> > The current format is is:
> > <DD> A whole bunch of text
> >     <DT> Something else </DT>
> > (There is a new line in there before <DT>)
> > 
> > Which I need to convert to
> > <DD> A whole bunch of text </DD>
> >     <DT> Something else </DT>


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to