On Fri, 2002-09-27 at 16:53, John Holmes wrote: This isn't accurate enough because <DT> is not always preceeded by: <DD> some text. It is sometimes preceeded by <DT> some text </DT> or other items.
This expression matches fairly well: <DD>[a-zA-Z0-9\.,'\-\s]* So it matches up to the <DT>: <DD> A whole bunch of text <DT> Something else </DT> But I don't know how to do the replace part. So essentially I need find this: <DD> A whole bunch of text and replace it with this: <DD> A whole bunch of text </DD> > > $new_text = str_replace("\n\t<DT>","</DD>\n\t<DT>",$your_text); > > Assuming there is a tab in there... > > You'd be better off getting an editor that'll do this for you. Textpad > lets you search and replace with regular expressions. Other programs do > too. > > ---John Holmes... > > > I have a fairly large html document that I need to convert to xml. > > The current format is is: > > <DD> A whole bunch of text > > <DT> Something else </DT> > > (There is a new line in there before <DT>) > > > > Which I need to convert to > > <DD> A whole bunch of text </DD> > > <DT> Something else </DT> -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php