> Thanks for the quick answer. :-) > > So can I say, for ArabicAnalyzer, generally it can tokenize the mixed > content with Arabic and English? :-) > > I am not really familiar with Arabic language. What do you mean for > "change > Arabic tokens"? Does Arabic has something like upper/lower case as English > does?
For the arabic anayzer this works, because you can detect the "language" easy from the used characters. But then it stems only the Arabic part. The English one is simply untouched. But a analyzer that should automatically detect English, French, German on the index and search side (see my email before) is almost impossible to create. Uwe --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org