In <[EMAIL PROTECTED]>, none wrote: > Gabriel Genellina wrote: >> En Mon, 18 Jun 2007 16:38:18 -0300, Sergio Monteiro Basto >> <[EMAIL PROTECTED]> escribió: >> >>> Can someone explain me, what is wrong with this site ? >>> >>> python linkExtractor3.py http://www.noticiasdeaveiro.pt > test >>> >>> HTMLParser.HTMLParseError: EOF in middle of construct, at line 1173, >>> column 1 >>> >>> at line 1173 of test file is perfectly normal . >> >> That page is not valid HTML - http://validator.w3.org/ finds 726 errors >> in it. > > ok but my problem is not understand what is the specific problem at line > 1173
You can't just look at that line and ignore the rest. There are 604 (!) errors, some about table rows, before this line. So the parser may be confused at this point and be already in an internal state that sees that line in a completely different light than you do. Ciao, Marc 'BlackJack' Rintsch -- http://mail.python.org/mailman/listinfo/python-list