<[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED]
> I am trying to extract some information from a few web pages, and I was > using the HTMLParser module. It worked fine until it got to the > javascript, at which it gave a parse error. It's fairly common for pages with Javascript to also be invalid HTML. HTMLParser isn't an 'ignore all errors silently and guess what it's meant to be' parser. Unless you have known good inputs it's often best to use an alternative. Some options are discussed in Uche's article here: http://www.xml.com/pub/a/2004/09/08/pyxml.html -- http://mail.python.org/mailman/listinfo/python-list