use "TokenStream" to print out the Tokens after using FrenchAnalyzer; test it work well, and the result is just you want?
You can use another tools "Luke" to see the "Lucene index File", is there any token of "or"(French) ? And you can query using the "Luke"... Good Luck 2011-01-13 --------------------------------- Yinjie Hong Ph. D. Student College of Computer Science, Zhejiang University Tel: 86-571-87952026 E-Mail: hongyj(at)zju(dot)edu(dot)cn Office: Room 400, Teaching Building #11, Yuquan Campus, Zhejiang University Thank you for your reply. I am using my own FrenchAnalyzer for lexical analysis. It extends org.apache.lucene.analysis.Analyzer and my stopwords set is empty. Benoit On 2011-01-12 23:05, Robert Muir wrote: > On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier > <benoit.merc...@member.fsf.org> wrote: >> Hi, >> >> I am happily using Lucene for several years to offer French lexical analysis >> tools to university researchers. Today, one of them decided to analyze the >> use of the French word "or" (meaning "gold" in French) in one of my corpus >> powered by Lucene... And, as you probably already guessed, no results... >> > What analyzer are you using? > > By default, StandardAnalyzer and StopAnalyzer uses a set of english > stopwords. For french, this list is probably not appropriate. > If you look at the javadocs, you can pass in your own set of > stopwords... for lexical analysis maybe this should be an empty set. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org