Re: Problem with parsing HTML

2012-05-12 Thread Yizhou Z.
NekoHTML parser uses Xerces' HTML DOM implementation. And it seems that it can always return the appropriate HTML DOM element objects for other types of element nodes. But for , I found it returns an object of type "org.apache.xerces.dom.ElementNSImpl". I wonder if this is a bug in the version of

Re: Problem with parsing HTML

2012-05-12 Thread Michael Glavassevich
Have you tried setting the 'document-class-name' property [1] so that it points to Xerces' HTML DOM implementation? Thanks. [1] http://xerces.apache.org/xerces2-j/properties.html#dom.document-class-name Michael Glavassevich XML Technologies and WAS Development IBM Toronto Lab E-mail: mrgla...@