[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-11-01 Thread Ezio Melotti
Ezio Melotti added the comment: This seems to be already fixed in 3.2/3.3, so I extracted the test from your script and added to the test suite. If you can find a way to break the parser let me know. -- assignee: -> ezio.melotti nosy: +ezio.melotti resolution: -> out of date stage:

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-11-01 Thread Roundup Robot
Roundup Robot added the comment: New changeset 6107a84e3c44 by Ezio Melotti in branch '3.2': #12008: add a test. http://hg.python.org/cpython/rev/6107a84e3c44 New changeset 495b31a8b280 by Ezio Melotti in branch 'default': #12008: merge with 3.2. http://hg.python.org/cpython/rev/495b31a8b280 -

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-09-03 Thread Piet van Oostrum
Piet van Oostrum added the comment: I was bitten by this bug today. Hope it will be solved in the next release of Python 3. It is also possible to use the third argument of search in line 285: m = attrfind_tolerant.search(rawdata, k, endpos) This seems to me to be a more `nat

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-05-06 Thread Éric Araujo
Changes by Éric Araujo : -- nosy: +eric.araujo, r.david.murray versions: +Python 3.3 ___ Python tracker ___ ___ Python-bugs-list maili

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-05-05 Thread svilen dobrev
svilen dobrev added the comment: (the nonstrict regexp came with Issue1046092) -- ___ Python tracker ___ ___ Python-bugs-list mailing

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-05-05 Thread svilen dobrev
Changes by svilen dobrev : -- components: +Library (Lib) type: -> behavior versions: +Python 3.2 Added file: http://bugs.python.org/file21894/test-htmlparser-attrs.py ___ Python tracker ___

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

2011-05-05 Thread svilen dobrev
New submission from svilen dobrev : nonstrict mode seems to eat too much into data and gets past endpos of the chunk processed, and parser gets confused and treats any subsequent stuff as data. i didn't think out how to fix the regexp as such, but instead limited its span to :endpos so it does