Package: urlscan
Version: 0.5.6
I get the following traceback for the attached mail:
47980:[EMAIL PROTECTED]: ~] urlscan d
Traceback (most recent call last):
File "/usr/bin/urlscan", line 81, in ?
main(msg)
File "/usr/bin/urlscan", line 73, in main
background = options.background)
File "/usr/lib/python2.4/site-packages/urlscan/urlchoose.py", line 37, in
__init__
for group, usedfirst, usedlast in extractedurls:
File "/usr/bin/urlscan", line 56, in msgurls
for chunk in msgurls(part, urlidx):
File "/usr/bin/urlscan", line 64, in msgurls
for chunk in urlscan.extracthtmlurls(msg.get_payload(decode = True)):
File "/usr/lib/python2.4/site-packages/urlscan/urlscan.py", line 351, in
extracthtmlurls
c.feed(s)
File "/usr/lib/python2.4/HTMLParser.py", line 108, in feed
self.goahead(0)
File "/usr/lib/python2.4/HTMLParser.py", line 171, in goahead
self.handle_charref(name)
File "/usr/lib/python2.4/site-packages/urlscan/urlscan.py", line 202, in
handle_charref
n = int(name)
ValueError: invalid literal for int(): xf6
--
Martin Michlmayr
http://www.cyrius.com/
From: x
To: Martin Michlmayr <[EMAIL PROTECTED]>
Subject: x
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="----=_Part_1386762_655054341.1177393971856"
Status: RO
Content-Length: 1604
Lines: 81
------=_Part_1386762_655054341.1177393971856
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
------=_Part_1386762_655054341.1177393971856
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: 7bit
<a href="http://www.linkedin.com/">View invitation from Michael Kröll</a>
------=_Part_1386762_655054341.1177393971856--