Careful that in the http world, there's an amibuity: x-www-form-url-encoded does not specify the content-encoding that the byts represented in the %-escaped sequences are written with.
That's fixed by the very recent URI spec where absence means utf-8...

My experience was that Tomcat simply converted the bytes of this into the first bytes of the 16-bit unicode, therefore working with iso-8859-1. We succeeded receiving forms from pages utf-8-encded by packing an inputstreamreader in utf-8 at the end of an inputstream that reads the bytes of the string of request.getParam...

Hope that helps.

paul



Le 1 juil. 05, à 22:41, <[EMAIL PROTECTED]> a écrit :


Did you check that the request string you get at the analyzer
level is corectly encoded as UTF-8?
We had the same problem with french accentuated char encoded
also as UTF-8, and transmited by tomcat as ISO-8859-1. It was
just for a test, also we didn't investgated a lot, but
re-encode in URL/ISO-8859-1 and re-decode from URL in correct
UTF-8, and it worked.
Don't know, if it may help you ...


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to