Awesome, thank you.  So as far as escaping user-entered data for use
in URLs...urlencode is best?

On Sep 4, 8:45 pm, Sam Lai <samuel....@gmail.com> wrote:
> Here's the code for the two (the numbers at the start of each line are
> just line numbers from the file) -
>
> iriencode:
> 128     """
> 129     Convert an Internationalized Resource Identifier (IRI) portion to a 
> URI
> 130     portion that is suitable for inclusion in a URL.
> 131
> 132     This is the algorithm from section 3.1 of RFC 3987.  However,
> since we are
> 133     assuming input is either UTF-8 or unicode already, we can
> simplify things a
> 134     little from the full method.
> 135
> 136     Returns an ASCII string containing the encoded result.
> 137     """
> 138     # The list of safe characters here is constructed from the
> "reserved" and
> 139     # "unreserved" characters specified in sections 2.2 and 2.3 of
> RFC 3986:
> 140     #     reserved    = gen-delims / sub-delims
> 141     #     gen-delims  = ":" / "/" / "?" / "#" / "[" / "]" / "@"
> 142     #     sub-delims  = "!" / "$" / "&" / "'" / "(" / ")"
> 143     #                   / "*" / "+" / "," / ";" / "="
> 144     #     unreserved  = ALPHA / DIGIT / "-" / "." / "_" / "~"
> 145     # Of the unreserved characters, urllib.quote already considers all but
> 146     # the ~ safe.
> 147     # The % character is also added to the list of safe characters
> here, as the
> 148     # end of section 3.1 of RFC 3987 specifically mentions that %
> must not be
> 149     # converted.
> 150     if iri is None:
> 151         return iri
> 152     return urllib.quote(smart_str(iri), safe="/#%[]=:;$&()+,!?*@'~")
>
> urlencode:
> 11     """
> 12     A version of Python's urllib.quote() function that can operate
> on unicode
> 13     strings. The url is first UTF-8 encoded before quoting. The
> returned string
> 14     can safely be used as part of an argument to a subsequent
> iri_to_uri() call
> 15     without double-quoting occurring.
> 16     """
> 17     return force_unicode(urllib.quote(smart_str(url), safe='/'))
>
> So iriencode only encodes the IRI portion (hence the longer list of
> safe characters), while URL will encode the entire URL, including any
> GET arguments and anchors.
>
> As for usage, I haven't encountered any IRIs, but I believe IRIs need
> to be encoded before inclusion in HTML (i.e. you can't just include
> the non-ASCII characters in HTML). As for urlencode, its main purpose
> is if you're including a URL in a form submission, e.g. the URL to go
> to after login. urlencode will do everything that iriencode does, but
> sometimes you might not want it to do that.
>
> On 5 September 2010 08:17, Jordon Wii <jordon...@gmail.com> wrote:
>
>
>
>
>
>
>
> > Anyone?  I haven't found anything that describes the difference
> > (except that one is for URI's and the other for URLs).
>
> > On Sep 4, 8:52 am, Jordon Wii <jordon...@gmail.com> wrote:
> >> What's the difference between the template filters urlencode and
> >> iriencode?  When should I use one over the other (or use both)?
>
> > --
> > You received this message because you are subscribed to the Google Groups 
> > "Django users" group.
> > To post to this group, send email to django-us...@googlegroups.com.
> > To unsubscribe from this group, send email to 
> > django-users+unsubscr...@googlegroups.com.
> > For more options, visit this group 
> > athttp://groups.google.com/group/django-users?hl=en.

-- 
You received this message because you are subscribed to the Google Groups 
"Django users" group.
To post to this group, send email to django-us...@googlegroups.com.
To unsubscribe from this group, send email to 
django-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/django-users?hl=en.

Reply via email to