New submission from Dan <negativ...@gmail.com>:

The attached HTML document (pulled from a Samsung printer web interface) 
contains the following invalid HTML tag:
<img style="vertical-align:bottom;" ,="" 
src="images/sws/icon_alert_warning_16.gif" title="Warning">
(invalid because of ,="")
In Python 3.x completely stops the HTML parser, preventing any further tags 
from being parsed. This does not happen in Python 2.x
See the attached Python script, which counts the number of "input" tags. When 
executed using Python 2.7, it correctly counts 4 such tags. When executed using 
Python 3.8 it only finds 1.

----------
components: Library (Lib)
files: testhtmlparse.zip
messages: 378101
nosy: dan
priority: normal
severity: normal
status: open
title: Regression in HTMLParser on malformed tags
type: behavior
versions: Python 3.8
Added file: https://bugs.python.org/file49497/testhtmlparse.zip

_______________________________________
Python tracker <rep...@bugs.python.org>
<https://bugs.python.org/issue41956>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to