Fredrik Lundh wrote: > both ElementTree and cElementTree support "sax-style" event generation > (through XMLTreeBuilder/XMLParser) and incremental parsing (through > iterparse). the cElementTree versions of these are even faster than > pyexpat. > > the iterparse interface is described here: > > http://effbot.org/zone/element-iterparse.htm > Thats cool! Thanks for the info!
For a multi-gigabyte file, I would still recommend C/C++, because the processing code which sits on top of the XML library needs to be Python, and that could turn out to be a significant overhead in such extreme cases. Of course, the exact strategy to follow would depend on the specifics of the case, and all this speculation may not really apply! :) Regards Sreeram
signature.asc
Description: OpenPGP digital signature
-- http://mail.python.org/mailman/listinfo/python-list