[web2py:12555] Re: non-ascii chars URL

achipa Thu, 27 Nov 2008 05:20:14 -0800

You can't really rely on a perlish 'do-what-I-mean' behaviour as it
gets worse - domain names (as part of the URI) can also contain
unicode in a *different* encoding, punycode. For example 
http://xn--tdali-d8a8w.lv/
that includes the Unicode representation of the Latvian "u with a
macron", and "n with cedilla", instead of the unmarked base
characters: http://tūdaliņ.lv - if you paste this 'funny char' version
into firefox, it will actually convert it to punycode and load the
page, although it does not display the unicode chars.


On Nov 27, 1:36 pm, "Daniel Contag" <[EMAIL PROTECTED]> wrote:
> My opinion is that we should adhere to the RFCs.
>
> Daniel
>
> On Thu, Nov 27, 2008 at 13:12, Jonathan Benn <[EMAIL PROTECTED]> wrote:
>
> > Hi Massimo,
>
> > On Nov 27, 6:13 am, mdipierro <[EMAIL PROTECTED]> wrote:
>
> >> Does anybody have any objection to this patch?
>
> > I object, on the grounds that the URL standard does *not* support
> > unicode. As Achipa points out, a unicode URL needs to be escaped in
> > order to pass validation.
>
> > If we want to support unicode URLs, rather than breaking web2py's
> > standards-compliance, I suggest either extending IS_URL with a new
> > "mode" that automatically escapes unicode, or creating a utility
> > function that escapes unicode URLs (but leaves non-Unicode URLs
> > intact).
>
> > What does everyone else think?
>
> > I hope that helps,
>
> > --Jonathan
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"web2py Web Framework" group.
To post to this group, send email to web2py@googlegroups.com
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/web2py?hl=en
-~----------~----~----~----~------~----~------~--~---

[web2py:12555] Re: non-ascii chars URL

Reply via email to