When I had this problem, I found out that the characters that I'm entering were
in UTF-8 format and java converts numbers to a cp1252 encoding. I took care of
this using xml.getBytes("UTF-8") for writing and similarly new
String(buffer,0,bytes_read,"UTF8") for reading. This solved my problem.
seeta
-----Original Message-----
From: David denBoer [mailto:[EMAIL PROTECTED]
Sent: Thursday, March 02, 2006 4:14 PM
To: [email protected]
Subject: Accented characters problem
Hi all,
We are havign a small problem searching for text with accents in the
query. Our index has a word like 'agréé', and when we search for it,
we get no results.
The query parses (using Snowball) to :
'name:"agr\213 \213"'
Using the ISOLatin filter, we get :
'name:agra'
neither gets any results.
When I perform the search using Luke, I get the expected results.
Is there something I am not doing right? I swear this worked with
Lucene 1.4.3 and is not working anymore, but it has been a while...
Thanks,
David.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]