Il Sat, 10 Jul 2010 16:24:23 +, mattia ha scritto:
> Hi all, I'm using py3k and the urllib package to download web pages. Can
> you suggest me a package that can translate reserved characters in html
> like "è", "ò", "é" in the corresponding correct
> encoding?
>
> Thanks,
> Mattia
Basically
On 7/10/2010 2:03 PM, mattia wrote:
Il Sat, 10 Jul 2010 18:09:12 +0100, MRAB ha scritto:
mattia wrote:
Hi all, I'm using py3k and the urllib package to download web pages.
Can you suggest me a package that can translate reserved characters in
html like "è", "ò", "é" in the corresponding
correc
> Hi all, I'm using py3k and the urllib package to download web pages. Can
> you suggest me a package that can translate reserved characters in html
> like "è", "ò", "é" in the corresponding correct
> encoding?
I think the html parser of LXML can convert the entities, too.
Christian
--
http:
Il Sat, 10 Jul 2010 18:09:12 +0100, MRAB ha scritto:
> mattia wrote:
>> Hi all, I'm using py3k and the urllib package to download web pages.
>> Can you suggest me a package that can translate reserved characters in
>> html like "è", "ò", "é" in the corresponding
>> correct encoding?
>>
> import r
mattia wrote:
Hi all, I'm using py3k and the urllib package to download web pages. Can
you suggest me a package that can translate reserved characters in html
like "è", "ò", "é" in the corresponding correct
encoding?
import re
from html.entities import entitydefs
# The downloaded web page w
On Jul 10, 2010, at 09:24 , mattia wrote:
> Hi all, I'm using py3k and the urllib package to download web pages. Can
> you suggest me a package that can translate reserved characters in html
> like "è", "ò", "é" in the corresponding correct
> encoding?
It won't do the whole job for you but you
Hi all, I'm using py3k and the urllib package to download web pages. Can
you suggest me a package that can translate reserved characters in html
like "è", "ò", "é" in the corresponding correct
encoding?
Thanks,
Mattia
--
http://mail.python.org/mailman/listinfo/python-list