"Eric J. Schwarzenbach" <[EMAIL PROTECTED]> wrote on 
12/27/2006 02:03:28 PM:

> Isn't a billion characters far more that you could ever load into a DOM
> in any practical environment?

In general yes. However, if you know in advance which regions of the 
document actually require random access and which parts of the document 
you will never visit it may be possible to keep the memory usage low by 
using a filter [1].

> The question I find more interesting that of the feasibility of VTD as
> general DOM / Xerces competitor, is could some of the underlying
> techniques [1] they use in processing the XML be used to make a more
> performant and scalable DOM parser? In their FAQ they suggest no [2],
> but they seem to assume that the whole DOM object structure need be
> created and not just parts of it lazily as they are actually used.

I understand Apache AXIOM [2] (which is an XML object model used by Axis2) 
will lazily contstuct its object tree by pulling document events from the 
parser as needed. There's no reason you couldn't do the same (or even 
better) with DOM.

> Eric

[1] 
http://www.w3.org/TR/2004/REC-DOM-Level-3-LS-20040407/load-save.html#LS-LSParserFilter
[2] http://ws.apache.org/commons/axiom/index.html

Michael Glavassevich
XML Parser Development
IBM Toronto Lab
E-mail: [EMAIL PROTECTED]
E-mail: [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to