Re: [xml] Recovering from errors in an XML "stream"

Webb Scales Mon, 09 Sep 2019 21:30:47 -0700

I'm OK with making small on-the-fly "edits" to the input (such asremoving the initial comment, or removing all comments), but trying tomake my code discern the overall structure (such as picking out theboundaries between the documents) is starting to step over into actuallyparsing it, which defeats the purpose of using LibXML2.

If the TextReader didn't insist upon reading beyond the root end-tag,that would enable me to solve my problem, I think. (I don't understandwhy it does that.) In the absence of any other options, I'm going toexperiment with the SAX interface and see if that will allow me to stopthe parse at the right spot.


Anyway, thanks for your replies, Liam.


            Webb



On 9/10/19 12:19 AM, Liam R E Quin wrote:

On Mon, 2019-09-09 at 22:41 -0400, Webb Scales wrote:

the
fact remains that I don't control the text that I'm trying to parse,
and I still need to parse it, even though it's not "well-formed".

You may need to write some form of pre-processor that fixes the
problems. As you say, that may reduce the need for an XML parser.

I haven't investigated error recovery with libxml, so someone else
might have better ideas.

Liam


--

Webb Scales
Principal Software Architect
603-673-2306
www.ursasecure.com <https://www.ursasecure.com>
w...@ursasecure.com <mailto:w...@ursasecure.com>

_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
xml@gnome.org
https://mail.gnome.org/mailman/listinfo/xml

Re: [xml] Recovering from errors in an XML "stream"

Reply via email to