This is obviously a bigger mess than I first thought. As I see it, the 
following options exist for resolving bug 22666.

1. WONTFIX - On the basis that there is too much uncertainty to do anything 
sensible and that any changes made might break interoperability as per Remy's 
point 3 below.

2. FIX - Patch the parameter class (as per Remy's point 2 below) on the grounds 
that the JSP spec states "The World Wide Web Consortium (http://www.w3.org/) is 
a definitive source of HTTP related information affecting this specification 
and its implementations." and the w3c view 
(http://www.w3.org/International/O-URL-code.html) is that URI encoding should 
always be based on UTF-8. However, this is still likely to break things (back 
to Remy's point 3).

3. FIX - Add a configuration option that enables w3c compliant URI decoding and 
patch the parameter and any other relevant classes to support this option. I am 
not 100% sure where the best place to do this would be. I am leaning towards 
adding it to the context as an optional parameter with a default state of 
disabled.

There are several bugs in bugzilla that look as if they are on similar lines 
and on that basis my own view is that option 3 is way to go. Before I start 
coding, I would be grateful for some feedback/guidance on my planned approach.

Thanks in anticipation.

Mark


On Friday, September 05, 2003 8:13 PM, Remy Maucherat [SMTP:[EMAIL PROTECTED] 
wrote:
> Mark Thomas wrote:
> > I was working from
> >
> > http://www.w3.org/International/O-URL-code.html
> >
> > Applying the patch fixed the problem as reported in bug 22666. I am happy 
to
> >
> > have another look at this. Can you point me in the direction of a better
> > reference?
>
> Well, -1 because:
> 1) Everyone ignores this standard
> 2) Your encoding will apply to *all* parameters, not just URL
> parameters; you have to patch the Parameters class for your patch to be
> correct (I would still vote -1, but at least it wouldn't break the
> specification)
> 3) It is extremely likely that people expect all parameters to have the
> same encoding, regardless of what that w3c spec says; if the servlet
> spec writes in big bold somewhere that the URL is always UTF8 (which
> would likely break interoperability with a lot of HTTP clients -
> possibly all), then it's different
>
> Remy
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to