Re: LowerCaseFilter fails one letter (I) of Turkish alphabet

Robert Muir Mon, 30 Nov 2009 11:09:21 -0800

> I am not sure if it is worth to add a new TokenFilter for Turkish language.
> I see there exist GreekLowerCaseFilter and RussianLowerCaseFilter. It would
> be nice to see TurkishLowerCaseFilter in Lucene.
>
>
>
just to clarify, GreekLowerCaseFilter really shouldn't exist either. The
final sigma problem it has (where there are two lowercase forms depending
upon position in word), this is also solved with unicode case folding or
collation. This is a perfect example of how lowercase is the wrong operation
for search.


and RussianLowerCaseFilter is deprecated now, it does the exact same thing
as LowerCaseFilter.

-- 
Robert Muir
rcm...@gmail.com

Re: LowerCaseFilter fails one letter (I) of Turkish alphabet

Reply via email to