Ok, the problem had nothing to do with the analyzers. What happened
was that the search query text was the input of a web page form, and I
hadn't set that form's method to "post", so the query was sent in the
address bar; that messed up the query's encoding.

So using BrazilianAnalyzer does work for indexing and search words
with accents...

Thanks for the help and best regards,
Eduardo Cordeiro

2006/8/3, Leandro Saad <[EMAIL PROTECTED]>:
I'm using StandardAnalyser all over, so, yes, portuguese stopwords won't be
eliminated

--
Leandro Rodrigo Saad Cruz
CTO - InterBusiness Technologies
db.apache.org/ojb
guara-framework.sf.net
xingu.sf.net

On 8/2/06, Eduardo S. Cordeiro <[EMAIL PROTECTED]> wrote:
>
> But was your index created with BrazilianAnalyzer? Because otherwise
> you wouldn't have portuguese stopwords eliminated, like "e", "ou",
> etc.
>
> 2006/8/2, Leandro Saad <[EMAIL PROTECTED]>:
> > Hi Eduardo. I'm using the StandardAnalyser and I can search for words
> with
> > accents. In my case "saúde"
> >
> > --
> > Leandro Rodrigo Saad Cruz
> > CTO - InterBusiness Technologies
> > db.apache.org/ojb
> > guara-framework.sf.net
> > xingu.sf.net
> >
> > On 8/1/06, Eduardo S. Cordeiro <[EMAIL PROTECTED]> wrote:
> > >
> > > Yes...here's how I create my QueryParser:
> > >
> > > QueryParser parser = new QueryParser("text", new BrazilianAnalyzer());
> > >
> > > 2006/8/1, Zhang, Lisheng <[EMAIL PROTECTED]>:
> > > > Hi,
> > > >
> > > > Have you used the same BrazilianAnalyzer when
> > > > searching?
> > > >
> > > > Best regards, Lisheng
> > > >
> > > > -----Original Message-----
> > > > From: Eduardo S. Cordeiro [mailto:[EMAIL PROTECTED]
> > > > Sent: Tuesday, August 01, 2006 1:40 PM
> > > > To: java-user@lucene.apache.org
> > > > Subject: Search with accents
> > > >
> > > >
> > > > Hello there,
> > > >
> > > > I have a brazilian portuguese index, which has been analyzed with
> > > > BrazilianAnalyzer. When searching words with accents, however,
> they're
> > > > not found -- for instance, if the index contains some text with the
> > > > word "maçã" and I search for that very word, I get no hits, but if I
> > > > search "maca" (which is another portuguese word) then the document
> > > > containing "maçã" is found.
> > > >
> > > > I've seen posts in the archive indicating that I should use
> > > > ISOLatin1AccentFilter to handle this, but I don't quite see how:
> > > > should I leave indexation as it is and use this filter only for
> search
> > > > queries or should I apply it in both cases?
> > > >
> > > > Thank you,
> > > > Eduardo Cordeiro
> > > >
> > > >
> ---------------------------------------------------------------------
> > > > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > > > For additional commands, e-mail: [EMAIL PROTECTED]
> > > >
> > > >
> > >
> >
> >
>


Reply via email to