Stefan Behnel <[EMAIL PROTECTED]> wrote: > Stefan Scholl wrote: >> Stefan Behnel <[EMAIL PROTECTED]> wrote: >>> Stefan Scholl wrote: >>>> Well, http://docs.python.org/lib/module-xml.sax.html is missing >>>> the fact, that I can't use Unicode with parseString(). >>>> >>>> This parseString() uses cStringIO. >>> Well, Python unicode is not a valid *byte* encoding for XML. >>> >>> lxml.etree can parse unicode, if you really want, but otherwise, you should >>> maybe stick to well-formed XML. >> >> The XML is well-formed. Works perfect in Python 2.4 with Python >> unicode and Python sax parser. > > The XML is *not* well-formed if you pass Python unicode instead of a byte > encoded string. Read the XML spec. > > It would be well-formed if you added the proper XML declaration, but that is > system specific (UCS-4 or UTF-16, BE or LE). So don't even try.
Who cares? I'm not calling any external tools. Python should know its own strings. -- http://mail.python.org/mailman/listinfo/python-list