On Mon, May 9, 2011 at 2:03 PM, Konstantin Kolinko
<knst.koli...@gmail.com> wrote:
<..>
> If ";" is part of the actual path, it must be escaped.
>
> If ";" starts a "path parameter" it must be unescaped. One well-known
> example is ";jsessionid" path parameter.

Thanks for your answer. Is this rule is just "de facto" rule, or is it
documented anywhere in RFC3986/RFC2396?

Extending my question, is there a clear criteria which would define
which characters always need escaping and which don't? At the moment I
am escaping everything that is not unreserved [1], but I am not sure
about SEOability and user-friendliness - this especially concerns path
with international characters in URLs, e.g. http://site/pathąčęė

I have also found a similar Tomcat bug [2], but it is addressing
slightly different issue.

If anyone wants to benefit this, I have just added 50 bonus points to
my SO question [3]. The main question I want to get answer for is -
which characters can and which need escaping, both in terms of RFC and
Tomcat.

Regards,
Mindaugas

1. According to RFC 3986, unreserved = ALPHA / DIGIT / "-" / "." / "_" / "~"
2. https://issues.apache.org/bugzilla/show_bug.cgi?id=51132
3. 
http://stackoverflow.com/questions/5913623/rfc3986-which-pchars-need-to-be-percent-encoded

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Reply via email to