Re: trying to parse non valid html documents with HTMLParser

florent Wed, 03 Aug 2005 02:45:38 -0700

> AFAIK not with HTMLParser or htmllib. You might try (if you haven't done
> yet) htmllib and see, which parser is more forgiving.


Thanks, I'll try htmllib.
In other case, I found a solution. Feeding data to the HTMLParser by 
chunks extracted from the string using string.split("<"), will allow me 
to loose only one tag at a time when an exception is raised !
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: trying to parse non valid html documents with HTMLParser

Reply via email to