In reading over the source for CPython's PyUnicode_EncodeDecimal, I see a dance to handle characters which are neither dec-equiv nor in Latin-1. Does anyone know about the intent of such a conversion?
As far as I can tell, error handling is one of: strict, replace, ignore, xmlcharrefreplace, or something_else What I don't understand is whether, in the ignore or something_else cases, there is any chance that digits will show up anywhere that they would not if these characters were treated as a character like '?'? Can someone either give me definitive "why not" or (preferably) give me a test case that shows where that interpretation does not hold. --Scott David Daniels [EMAIL PROTECTED] -- http://mail.python.org/mailman/listinfo/python-list