Dear list, Apache Xerces-J says that it implements DOM Level 1 HTML. I asked recently about the bootstrapping support, which did not yield answers, so let me broaden the question.
Can anyone point me towards some way of reading HTML (non XML<https://www.w3.org/TR/DOM-Level-1/introduction.html#ID-E7C3082>) files using Xerces-J? I tried various things using org.apache.xerces.parsers.DOMParserImpl but parsing this file<https://github.com/oliviercailloux/JARiS/blob/main/src/test/resources/io/github/oliviercailloux/jaris/xml/Html/Simple.html> for example (valid according to Nu validator<https://validator.nu/>) fails. I am ready to use a way that does not follow the W3C bootstrapping one, if required. Thanks a lot!