On Wed, Apr 13, 2011 at 9:51 AM, Clemens Wyss <clemens...@mysign.ch> wrote:
> What I really want to do is ignore german stop words such as "der", "die", 
> "das", "ein",...

GermanAnalyzer takes a stemExclusionSet if you put those terms into
this set the stemmer will not touch them. This should be in 3.1 I
think

public GermanAnalyzer(Version matchVersion, Set<?> stopwords, Set<?>
stemExclusionSet)

simon

>
>> -----Ursprüngliche Nachricht-----
>> Von: Robert Muir [mailto:rcm...@gmail.com]
>> Gesendet: Dienstag, 12. April 2011 17:03
>> An: java-user@lucene.apache.org
>> Betreff: Re: German*Filter, Analyzer "cutting" off letters from (french)
>> words...
>>
>> On Tue, Apr 12, 2011 at 8:46 AM, Clemens Wyss <clemens...@mysign.ch>
>> wrote:
>> > Why so? Where have the e's gone?
>> >
>>
>> the e is being stemmed as its a german suffix... all of the german stemming
>> algorithms remove final -e, as do all the french stemming algorithms.
>>
>> so i don't understand your problem.
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to