Hi Robert, Thank you so much for the kind response and seems it's working fine...
Could you please ensure whether the below one restricts to the greek region alone? UnicodeSet unicodeSet = new UnicodeSet().applyPattern("[:Greek:]"); Normalizer2 base = Normalizer2.getInstance(ICUFoldingFilter.class. > getResourceAsStream("utr30.nrm"), "utr30", Normalizer2.Mode.COMPOSE); Normalizer2 normalizeFilter = new FilteredNormalizer2(base, unicodeSet); > TokenStream tok = new ICUNormalizer2Filter(tok, normalizeFilter); Kindly help me to resolve this. -- Regards, Chitra