If end tags are missing, the data simply isn't XML and you shouldn't expect
XML tools to handle it. Part of the point of moving from SGML to XML was
precisely to drive folks toward writing well-formed documents rather than
trying to guess past their errors.

If it's HTML (which is based on SGML), you may want to look at the NekoHTML
parser, which tolerates some sloppiness, or the W3C's "Tidy" tool which
attempts to recover from much more.

If it's something else... Frankly, the right answer here is to fix the file
before it reaches the XML parser, either by fixing whatever generates it or
by running a preprocessor of some sort.

______________________________________
"... Three things see no end: A loop with exit code done wrong,
A semaphore untested, And the change that comes along. ..."
  -- "Threes" Rev 1.1 - Duane Elms / Leslie Fish
(http://www.ovff.org/pegasus/songs/threes-rev-11.html)

Reply via email to