Re: Parsing HTML with JavaScript

Richard Brodie Fri, 13 May 2005 02:00:06 -0700

<[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED]


> I am trying to extract some information from a few web pages, and I was
> using the HTMLParser module. It worked fine until it got to the
> javascript, at which it gave a parse error.

It's fairly common for pages with Javascript to also be invalid HTML.
HTMLParser isn't an 'ignore all errors silently and guess what it's
meant to be' parser. Unless you have known good inputs it's often
best to use an alternative. Some options are discussed in Uche's article
here: http://www.xml.com/pub/a/2004/09/08/pyxml.html


-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Parsing HTML with JavaScript

Reply via email to