Dear list,

Apache Xerces-J says that it implements DOM Level 1 HTML.  I asked recently 
about the bootstrapping support, which did not yield answers, so let me broaden 
the question.

Can anyone point me towards some way of reading HTML (non 
XML<https://www.w3.org/TR/DOM-Level-1/introduction.html#ID-E7C3082>) files 
using Xerces-J? I tried various things using 
org.apache.xerces.parsers.DOMParserImpl but parsing this 
file<https://github.com/oliviercailloux/JARiS/blob/main/src/test/resources/io/github/oliviercailloux/jaris/xml/Html/Simple.html>
 for example (valid according to Nu validator<https://validator.nu/>) fails.

I am ready to use a way that does not follow the W3C bootstrapping one, if 
required.

Thanks a lot!

Reply via email to