When I had this problem, I found out that the characters that I'm entering were 
in UTF-8 format and java converts numbers to a cp1252 encoding. I took care of 
this using xml.getBytes("UTF-8") for writing and similarly   new 
String(buffer,0,bytes_read,"UTF8") for reading. This solved my problem.

seeta

-----Original Message-----
From: David denBoer [mailto:[EMAIL PROTECTED] 
Sent: Thursday, March 02, 2006 4:14 PM
To: java-user@lucene.apache.org
Subject: Accented characters problem

Hi all,

We are havign a small problem searching for text with accents in the  
query. Our index has a word like 'agréé', and when we search for it,  
we get no results.

The query parses (using Snowball) to :
'name:"agr\213 \213"'

Using the ISOLatin filter, we get :
'name:agra'

neither gets any results.

When I perform the search using Luke, I get the expected results.

Is there something I am not doing right? I swear this worked with  
Lucene 1.4.3 and is not working anymore, but it has been a while...

Thanks,
David.
  
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to