Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
" Looking at the index with Luke it shows me "ni�os" but when I want to see the full text (by right clicking) it shows me ni�os. I know Lucene is supposed to store fields in UTF8, but then, how can I make sure I sotre something and get it back just as it was, including special chara

Re: Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
On Thu, Aug 21, 2008 at 12:47 PM, Steven A Rowe <[EMAIL PROTECTED]> wrote: > Hola Juan, Hi Steve > > > On 08/21/2008 at 1:16 PM, Juan Pablo Morales wrote: > > I have an index in Spanish and I use Snowball to stem and > > analyze and it works perfectly. However,

Re: Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
t;Document theDoc = hits.doc(0); >String nombre = theDoc.get("name"); >System.out.println("Nombre: " + nombre); > } > } > > > When I run this in IntelliJ, I get: > > Query: name:niñ > Nombre: niños > > Process finished with exit code 0 > > > Are you by chance indexing XML? Indirectly, yes > > -- Juan Pablo Morales Ingenian Software ltda

Re: Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
It was, after all an XML issue, the servlets creating the content that was being indexed were not sending UTF but the XML declaration stated the code WAS UTF, so it really was not a Lucene issue after all. Thanks for all the help. On Thu, Aug 21, 2008 at 6:18 PM, Juan Pablo Morales <[EM