hi! I am working with InDesign exported xml and parse it in a python application. I learned here: http://boodebr.org/main/python/all-about-python-and-unicode that there actually are sets of illegal unicode characters for xml (and henceforth for every compliant xml parser). I already implemented a regex solution to replace the characters in question, but I wonder if there is a efficient and out-of-the-box solution somewhere out there for this problem. does anybody know?
thanks! gabriel -- http://mail.python.org/mailman/listinfo/python-list