Hi,
I've been running out of memory reusing the same XMLReader (xercesImpl-2.8.0) to parse many large documents.
The documents reference the same DTD which references many entities.
Profiling (with netbeans-5.0) reveals that the problem is with char[]s allocated by:

org.apache.xerces.util.SymbolTable.$Entry.<init>
  org.apache.xerces.util.SymbolTable.addSymbol()
    org.apache.xerces.impl.XMLEntityScanner.scanName()
      org.apache.xerces.impl.XMLDTDscannerImpl.scanEntityDecl()
...
Maybe its storing the symbol table for the same DTD for each new document and never discarding it? Should it recognize a previously parsed DTD and reuse the existing symbol table?

I've worked around it by using a new XMLReader for each document.

Can I get DTDs and entities cached to improve performance?
I'm using org.apache.xerces.util.XMLCatalogResolver.

Cheers,
   Neil.




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to