For various reasons I am using HTMLParser to do transcription of some xml. I need to keep charrefs as is so for Python > 3.4 I pass in

convert_charrefs =False

to the constructor.

This seems to work as intended for data, but I notice that a change in Python 3.4 prevents me from keeping the charrefs which are in attribute strings.

Is it intentional that we can no longer use HTMLParser.unescape? It seems to prevent correct interpretation of the convert_charrefs constructor argument.

The unescaping is done, but in a module level function which means I can no longer override that functionality safely.
--
Robin Becker

--
https://mail.python.org/mailman/listinfo/python-list

Reply via email to