En Tue, 22 Jan 2008 15:35:16 -0200, Boris <[EMAIL PROTECTED]> escribió:
> I'm using debian linux, Python 2.4.4, and utidylib (http:// > utidylib.berlios.de/). I wrote simple functions to get a web page, > convert it from windows-1251 to utf8 and then I'd like to clean html > with it. Why the intermediate conversion? I don't know utidylib, but can't you feed it with the original page, in the original encoding? If the page itself contains a "meta http-equiv" tag stating its content-type and charset, it won't be valid anymore if you reencode the page. -- Gabriel Genellina -- http://mail.python.org/mailman/listinfo/python-list