New submission from Maciek J <e...@wp.pl>: Not sure if this is a Python problem or an expat problem, but I get truncated data while parsing XML documents.
This particular project is for parsing an XML file of Wikipedia dump. The attached files are: * xml-parse-revisions.py - parser script * revision-test.xml - input XML * revision-test.xml.sql - output XML * revision_create.sql - not really needed for this test case, but attached for completeness You can notice that the output file sometimes contains too short values for the "timestamp". Also note that if you add whitespace to the input XML, then different timestamps will be truncated. My Python is 2.6.6. ---------- components: XML files: pyxml_error.zip messages: 119184 nosy: Maciek.J priority: normal severity: normal status: open title: Data truncation in expat parser versions: Python 2.6 Added file: http://bugs.python.org/file19292/pyxml_error.zip _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue10149> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com