Re: HTMLParser not parsing whole html file

2010-10-26 Thread John Nagle
On 10/24/2010 11:44 PM, Stefan Behnel wrote: josh logan, 25.10.2010 04:14: I found the error. The HTML file I'm parsing has invalid HTML at line 193. It has something like: Note there is no space between the closing quote for the "href" tag and the class attribute. I guess I'll go through eac

Re: HTMLParser not parsing whole html file

2010-10-24 Thread Stefan Behnel
josh logan, 25.10.2010 04:14: I found the error. The HTML file I'm parsing has invalid HTML at line 193. It has something like: Note there is no space between the closing quote for the "href" tag and the class attribute. I guess I'll go through each file and correct these issues as I parse the

Re: HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
On Oct 24, 4:38 pm, josh logan wrote: > On Oct 24, 4:36 pm, josh logan wrote: > > > > > > > Hello, > > > I wanted to use python to scrub an html file for score data, but I'm > > having trouble. > > I'm using HTMLParser, and the parsing seems to fizzle out around line > > 192 or so. None of the ev

Re: HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
On Oct 24, 4:36 pm, josh logan wrote: > Hello, > > I wanted to use python to scrub an html file for score data, but I'm > having trouble. > I'm using HTMLParser, and the parsing seems to fizzle out around line > 192 or so. None of the event functions are being called anymore > (handle_starttag, ha