On Wed, Apr 13, 2011 at 11:03 AM, Clemens Wyss <clemens...@mysign.ch> wrote: > I tried: > Set<String> stemsToBeIgnored = new HashSet<String>(Arrays.asList( "e" )); > GermanAnalyzer ga = new GermanAnalyzer( Version.LUCENE_31, > GermanAnalyzer.getDefaultStopSet(), stemsToBeIgnored );
try Arrays.asList("der", "die", "das", "ein"); or do I get you wrong.... simon > > But the e's are still "removed"... > >> -----Ursprüngliche Nachricht----- >> Von: Simon Willnauer [mailto:simon.willna...@googlemail.com] >> Gesendet: Mittwoch, 13. April 2011 10:51 >> An: java-user@lucene.apache.org >> Cc: Clemens Wyss >> Betreff: Re: German*Filter, Analyzer "cutting" off letters from (french) >> words... >> >> On Wed, Apr 13, 2011 at 9:51 AM, Clemens Wyss <clemens...@mysign.ch> >> wrote: >> > What I really want to do is ignore german stop words such as "der", "die", >> "das", "ein",... >> >> GermanAnalyzer takes a stemExclusionSet if you put those terms into this >> set the stemmer will not touch them. This should be in 3.1 I think >> >> public GermanAnalyzer(Version matchVersion, Set<?> stopwords, Set<?> >> stemExclusionSet) >> >> simon >> >> > >> >> -----Ursprüngliche Nachricht----- >> >> Von: Robert Muir [mailto:rcm...@gmail.com] >> >> Gesendet: Dienstag, 12. April 2011 17:03 >> >> An: java-user@lucene.apache.org >> >> Betreff: Re: German*Filter, Analyzer "cutting" off letters from >> >> (french) words... >> >> >> >> On Tue, Apr 12, 2011 at 8:46 AM, Clemens Wyss >> <clemens...@mysign.ch> >> >> wrote: >> >> > Why so? Where have the e's gone? >> >> > >> >> >> >> the e is being stemmed as its a german suffix... all of the german >> >> stemming algorithms remove final -e, as do all the french stemming >> algorithms. >> >> >> >> so i don't understand your problem. >> >> >> >> --------------------------------------------------------------------- >> >> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org >> >> For additional commands, e-mail: java-user-h...@lucene.apache.org >> > >> > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org