: I am using the SnowballAnalyzer because of it's multi-language stemming : capabilities - and am very happy with that. : There is one small glitch which I'm hoping to overcome - can I get it to split : up internet domain names in the same way that StopAnalyzer does?
90% of the Lucene Analyzers that exist tend to be simple wrappers arround Tokenizers and TokenFilters -- this is true for SnowballAnalyzer and StopAnalyzer as well -- all those classes do is setup some initialization work, and then delegate to various Tokenizers and TokenFilters ... if you poke arround in the code for SnowballAnalyzer you'll see that you can write your own analyzer that uses SnowballFilter along with whatever tokenizer you want. (if you like StopAnalyzer's tokenization, that would be LowerCaseTokenizer) -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]