Hi, ID and NAME tokens must begin with a letter ([A-Za-z]) and may be followed by any number of letters, digits ([0-9]), hyphens ("-"), underscores ("_"), colons (":"), and periods ("."). [1]
Is there a encoder / decoder in Python that can convert arbitrary text to and from this encoding in an readable manner? base64 is not readable: In [31]: base64.encodestring("/TEST") Out[31]: 'L1RFU1Q=\n' quopri does not encode for example "/". In [25]: quopri.encodestring("/test") Out[25]: '/test' Anything else? -- Greg [1] http://www.w3.org/TR/html4/types.html#type-cdata -- http://mail.python.org/mailman/listinfo/python-list