Yeah, the error is thrown by HTMLParser, TAG is build on top of it.
I will try some other tools like Beautifull Soup.

Thanks.


2014-05-20 10:04 GMT-03:00 Anthony <abasta...@gmail.com>:

> No, TAG is only a basic parser and not robust against errors in the HTML.
> You should probably use a more sophisticated tool, such as Beautiful Soup
> (which is built on top of the lxml and html5lib parsers). The standard
> library also includes the HTMLParser module, but you may run into similar
> problems with that.
>
> Anthony
>
>
> On Tuesday, May 20, 2014 8:14:37 AM UTC-4, yamandu wrote:
>>
>> I am trying to parse a HTML with the TAG helper from a fetched URL using
>> urllib.
>> The HTML is broken in some parts, it has end span tags without respective
>> start span tags.
>>
>> TAG helper gives error: unable to balance span tag.
>>
>> I tested it. Open tags not closed are parsed, but not closed tags without
>> open.
>>
>> Would be there a work around for this?
>>
>  --
> Resources:
> - http://web2py.com
> - http://web2py.com/book (Documentation)
> - http://github.com/web2py/web2py (Source code)
> - https://code.google.com/p/web2py/issues/list (Report Issues)
> ---
> You received this message because you are subscribed to the Google Groups
> "web2py-users" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to web2py+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.
>



-- 
Att.

Carlos J. Costa
Cientista da Computação
Esp. Gestão em Telecom

EL MELECH NEEMAN!
אָמֵן

-- 
Resources:
- http://web2py.com
- http://web2py.com/book (Documentation)
- http://github.com/web2py/web2py (Source code)
- https://code.google.com/p/web2py/issues/list (Report Issues)
--- 
You received this message because you are subscribed to the Google Groups 
"web2py-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to web2py+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to