Re: Indexing .txt file containing english, german or french alphabet

2005-09-26 Thread Ian Soboroff
Otis Gospodnetic <[EMAIL PROTECTED]> writes: > For indexing text that has multiple languages I don't know what to > recommend. Well, I do - try the StandardAnalyzer and see if that > produces satisfactory results, but you'd really need a smart analyzer > that knows how to properly tokenize an

Re: Indexing .txt file containing english, german or french alphabet

2005-09-25 Thread Otis Gospodnetic
For dealing with parsing + indexing RTF, see chapter 7 of Lucene in Action. For indexing text that has multiple languages I don't know what to recommend. Well, I do - try the StandardAnalyzer and see if that produces satisfactory results, but you'd really need a smart analyzer that knows how

Indexing .txt file containing english, german or french alphabet

2005-09-25 Thread tirupathi reddy
Hello, I have to index the text in the .txt document. This text document contains english characters , german characters etc. Please tell me how can I index that text document. Is the procedure of indexing RTF documents can be applied here? thanx, MTREDDY Tirupati Reddy Manyam 24-06-08