Ok, the problem had nothing to do with the analyzers. What happened was that the search query text was the input of a web page form, and I hadn't set that form's method to "post", so the query was sent in the address bar; that messed up the query's encoding.
So using BrazilianAnalyzer does work for indexing and search words with accents... Thanks for the help and best regards, Eduardo Cordeiro 2006/8/3, Leandro Saad <[EMAIL PROTECTED]>:
I'm using StandardAnalyser all over, so, yes, portuguese stopwords won't be eliminated -- Leandro Rodrigo Saad Cruz CTO - InterBusiness Technologies db.apache.org/ojb guara-framework.sf.net xingu.sf.net On 8/2/06, Eduardo S. Cordeiro <[EMAIL PROTECTED]> wrote: > > But was your index created with BrazilianAnalyzer? Because otherwise > you wouldn't have portuguese stopwords eliminated, like "e", "ou", > etc. > > 2006/8/2, Leandro Saad <[EMAIL PROTECTED]>: > > Hi Eduardo. I'm using the StandardAnalyser and I can search for words > with > > accents. In my case "saúde" > > > > -- > > Leandro Rodrigo Saad Cruz > > CTO - InterBusiness Technologies > > db.apache.org/ojb > > guara-framework.sf.net > > xingu.sf.net > > > > On 8/1/06, Eduardo S. Cordeiro <[EMAIL PROTECTED]> wrote: > > > > > > Yes...here's how I create my QueryParser: > > > > > > QueryParser parser = new QueryParser("text", new BrazilianAnalyzer()); > > > > > > 2006/8/1, Zhang, Lisheng <[EMAIL PROTECTED]>: > > > > Hi, > > > > > > > > Have you used the same BrazilianAnalyzer when > > > > searching? > > > > > > > > Best regards, Lisheng > > > > > > > > -----Original Message----- > > > > From: Eduardo S. Cordeiro [mailto:[EMAIL PROTECTED] > > > > Sent: Tuesday, August 01, 2006 1:40 PM > > > > To: java-user@lucene.apache.org > > > > Subject: Search with accents > > > > > > > > > > > > Hello there, > > > > > > > > I have a brazilian portuguese index, which has been analyzed with > > > > BrazilianAnalyzer. When searching words with accents, however, > they're > > > > not found -- for instance, if the index contains some text with the > > > > word "maçã" and I search for that very word, I get no hits, but if I > > > > search "maca" (which is another portuguese word) then the document > > > > containing "maçã" is found. > > > > > > > > I've seen posts in the archive indicating that I should use > > > > ISOLatin1AccentFilter to handle this, but I don't quite see how: > > > > should I leave indexation as it is and use this filter only for > search > > > > queries or should I apply it in both cases? > > > > > > > > Thank you, > > > > Eduardo Cordeiro > > > > > > > > > --------------------------------------------------------------------- > > > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > > > > > > > > >