On Sat, 2 Feb 2002, Bill Barker wrote: > > + 4416 URI En/Decoding not working > > + (investigate and fix if feasible) > My vote is for LATER, since as I understand the bug it is too late to test > this well, and the fix (if not done right) has the potential to create > security problems. The fix is to basically flip UEncoder on it's head, and > work with "un-safe chars" instead of "safe chars" (as well as to add the > logic to use the encoding). If Costin (since it's his baby) thinks he's up > to it, by all means go for it. I just don't want to delay the release for > the amount of time it would take me to make and be comfortable with the fix > (esp. since there is a work-around already).
I'm not sure I understand - the bug seems to be about DecodeInterceptor using 8859_1 for decoding, even if a different decoding was found. I don't think it is touching UEncoder and the url encoding/decoding. The url decoding has nothing to do with the charset - we decode %xx as bytes, the url encoding happens after char->byte and decoding happen before byte->char conversions ( i.e. uencoding operates on bytes ). It is possible we have a bug - and a test case would help finding it. The code is quite tricky ( I spent huge amounts of time with charset/encoding issues ), and I agree LATER is good given the risks. But if I have the test case, I can take a look, it may be a simple fix. The way it is supposed to work - first the bytes are url decoded, then we detect the charset, then convert bytes to chars. Am I missing something here ? Costin -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>