> I am not sure if it is worth to add a new TokenFilter for Turkish language. > I see there exist GreekLowerCaseFilter and RussianLowerCaseFilter. It would > be nice to see TurkishLowerCaseFilter in Lucene. > > > just to clarify, GreekLowerCaseFilter really shouldn't exist either. The final sigma problem it has (where there are two lowercase forms depending upon position in word), this is also solved with unicode case folding or collation. This is a perfect example of how lowercase is the wrong operation for search.
and RussianLowerCaseFilter is deprecated now, it does the exact same thing as LowerCaseFilter. -- Robert Muir rcm...@gmail.com