Hi, I'd advocate for using SAX, as DOM related methods implies loading the complete XML content in memory whereas SAX grab things on the fly. SAX method should therefore be faster and less memory consuming...
By the way, if your goal is to just "combine the text out of page:title and page:revision:text for every single page element", maybe you should also consider an XSLT filter. Regards, Thierry -- http://mail.python.org/mailman/listinfo/python-list