When I had this problem, I found out that the characters that I'm entering were in UTF-8 format and java converts numbers to a cp1252 encoding. I took care of this using xml.getBytes("UTF-8") for writing and similarly new String(buffer,0,bytes_read,"UTF8") for reading. This solved my problem.
seeta -----Original Message----- From: David denBoer [mailto:[EMAIL PROTECTED] Sent: Thursday, March 02, 2006 4:14 PM To: java-user@lucene.apache.org Subject: Accented characters problem Hi all, We are havign a small problem searching for text with accents in the query. Our index has a word like 'agréé', and when we search for it, we get no results. The query parses (using Snowball) to : 'name:"agr\213 \213"' Using the ISOLatin filter, we get : 'name:agra' neither gets any results. When I perform the search using Luke, I get the expected results. Is there something I am not doing right? I swear this worked with Lucene 1.4.3 and is not working anymore, but it has been a while... Thanks, David. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]