tags 731931 |fixed-upstream
thanks

|On 2013-12-11 12:15, Alexis Huxley wrote:
With jessie's urlwatch (and prerequisite python-concurrent.futures):

torchio$ egrep -v '^($|#)' ~/.urlwatch/urls.txt
http://www.kayser-threde.de/en/jobs/
torchio$ urlwatch
Traceback (most recent call last):
   File "/usr/bin/urlwatch", line 260, in <module>
     raise exception
LookupError: unknown encoding: 0
torchio$

This is due to the server sending an invalid charset in the header:

% HEAD http://www.kayser-threde.de/en/jobs/
200 OK
Connection: close
Date: Wed, 29 Jan 2014 19:06:43 GMT
Server: Apache/2.2.13 (Linux/SUSE)
Content-Type: text/html; charset=0
X-Powered-By: PHP/5.3.3

(see the Content-Type header)

I've now added a workaround upstream that tries to decode the web page content with the "ascii" codec in case it fails to lookup the encoding name:

https://github.com/thp/urlwatch/commit/305f630b4a


Thanks,
Thomas


--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to