use "TokenStream" to print out the Tokens after using FrenchAnalyzer; test it 
work well, and the result is just you want?

You can use another tools "Luke" to see the "Lucene index File", is there any 
token of "or"(French) ?
And you can query using the "Luke"...



Good Luck



2011-01-13



---------------------------------
Yinjie Hong
Ph. D. Student
College of Computer Science, Zhejiang University
 
Tel:       86-571-87952026
E-Mail: hongyj(at)zju(dot)edu(dot)cn
Office:  Room 400, Teaching Building #11, Yuquan Campus, Zhejiang University






Thank you for your reply. 

I am using my own FrenchAnalyzer for lexical analysis.  It extends  
org.apache.lucene.analysis.Analyzer and my stopwords set is empty. 

Benoit 

On 2011-01-12 23:05, Robert Muir wrote: 
> On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier 
> <benoit.merc...@member.fsf.org>  wrote: 
>> Hi, 
>> 
>> I am happily using Lucene for several years to offer French lexical analysis 
>> tools to university researchers.   Today, one of them decided to analyze the 
>> use of the French word "or" (meaning "gold" in French) in one of my corpus 
>> powered by Lucene...  And, as you probably already guessed, no results... 
>> 
> What analyzer are you using? 
> 
> By default, StandardAnalyzer and StopAnalyzer uses a set of english 
> stopwords. For french, this list is probably not appropriate. 
> If you look at the javadocs, you can pass in your own set of 
> stopwords... for lexical analysis maybe this should be an empty set. 
> 
> --------------------------------------------------------------------- 
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org 
> For additional commands, e-mail: java-user-h...@lucene.apache.org 
> 

--------------------------------------------------------------------- 
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org 
For additional commands, e-mail: java-user-h...@lucene.apache.org 

Reply via email to