htmlparser charrefs

Robin Becker Thu, 22 Dec 2016 05:47:27 -0800

For various reasons I am using HTMLParser to do transcription of some xml. Ineed to keep charrefs as is so for Python > 3.4 I pass in


convert_charrefs =False


to the constructor.

This seems to work as intended for data, but I notice that a change in Python3.4 prevents me from keeping the charrefs which are in attribute strings.

Is it intentional that we can no longer use HTMLParser.unescape? It seems toprevent correct interpretation of the convert_charrefs constructor argument.

The unescaping is done, but in a module level function which means I can nolonger override that functionality safely.

--
Robin Becker

--
https://mail.python.org/mailman/listinfo/python-list

htmlparser charrefs

Reply via email to