Analyzer question

Dan Armbrust Mon, 08 Aug 2005 07:43:58 -0700

It is my understanding that the StandardAnalyzer will remove underscores- so "some_word" be indexed as 'some' and 'word'.

I want to keep the underscores, so I was thinking of changing over to anAnalyzer that uses the WhiteSpaceTokenizer, LowerCaseFilter, and StopFilter.

What other tokenizing magic will I lose by changing away from theStandardAnalyzer?


Thanks,

Dan

--
****************************
Daniel Armbrust
Biomedical Informatics
Mayo Clinic Rochester
daniel.armbrust(at)mayo.edu
http://informatics.mayo.edu/


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Analyzer question

Reply via email to