On Fri, 2006-01-20 at 00:25 +0100, Francesco Potortì wrote:
> > This has been tracked down to a link that has non-ascii characters
> > in it.
> 
> I cannot find it.  Could you please point me to that?

They are all mail? or mailto scheme urls on pages
  http://pot.potorti.it/
  http://www.potorti.it/

Also note that pages from http://pot.potorti.it/ do not have an encoding
specified which makes them only render correctly if the use uses the
same character set as used on the website.

> > I have fixed webcheck to firstly not fail on non-ascii characters
> > any more when generating a report and secondly to escape non-ascii
> > characters when parsing html.
> 
> Shouldn't it signal this as a problem in the report?

Maybe, but I'm not sure. I don't see any real problems in using
non-ascii characters in mailto urls but I think this shouldn't be done
for http or ftp urls. But to fully support this in webcheck urls will
have to be stored as unicode strings (which will probably affect a lot
of portions of the code).

-- 
-- arthur - [EMAIL PROTECTED] - http://people.debian.org/~adejong --

Attachment: signature.asc
Description: This is a digitally signed message part

Reply via email to