Re: HTMLParser not parsing whole html file

2010-10-26 Thread John Nagle
On 10/24/2010 11:44 PM, Stefan Behnel wrote: josh logan, 25.10.2010 04:14: I found the error. The HTML file I'm parsing has invalid HTML at line 193. It has something like: Note there is no space between the closing quote for the "href" tag and the class attribute. I guess I'll go through eac

Re: HTMLParser not parsing whole html file

2010-10-24 Thread Stefan Behnel
josh logan, 25.10.2010 04:14: I found the error. The HTML file I'm parsing has invalid HTML at line 193. It has something like: Note there is no space between the closing quote for the "href" tag and the class attribute. I guess I'll go through each file and correct these issues as I parse the

Re: HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
On Oct 24, 4:38 pm, josh logan wrote: > On Oct 24, 4:36 pm, josh logan wrote: > > > > > > > Hello, > > > I wanted to use python to scrub an html file for score data, but I'm > > having trouble. > > I'm using HTMLParser, and the parsing seems to fizzle out around line > > 192 or so. None of the ev

Re: HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
On Oct 24, 4:36 pm, josh logan wrote: > Hello, > > I wanted to use python to scrub an html file for score data, but I'm > having trouble. > I'm using HTMLParser, and the parsing seems to fizzle out around line > 192 or so. None of the event functions are being called anymore > (handle_starttag, ha

HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
Hello, I wanted to use python to scrub an html file for score data, but I'm having trouble. I'm using HTMLParser, and the parsing seems to fizzle out around line 192 or so. None of the event functions are being called anymore (handle_starttag, handle_endtag, etc.) and I don't understand why, becau