: In order to do this, we tried subclassing the SnowballAnalyzer... it : doesn't work yet, though. Here is the code of our custom class:
At first glance, what youv'e got seems fine, can you elaborate on what you mean by "it doesn't work" ? Perhaps the issue is that the SnowballStemmer can't handle the accented characters, and you should strip them first, then stem? public TokenStream tokenStream(String fieldName, Reader reader) { TokenStream result = new StandardTokenizer(reader); result = new StandardFilter(result); result = new LowerCaseFilter(result); if (stopSet != null) result = new StopFilter(result, stopSet); result = new ISOLatin1AccentFilter(result); result = new SnowballFilter(result, name); return result; } -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]