Re: Lucene Arabic Internationalization Question

Nader Henein Fri, 27 May 2005 14:20:10 -0700

Dear Rasha,

Sorry for the delay, I've indexed Arabic and English seamlessly onLucene, the only thing you have to watch out for is stemming, as forindexing PDFs, I have not used that part of the API, but fromexperience, this comes down to using or in some cases forcing thecorrect encoding, debug this by bringing down your development to thelowest denominator, for example if you're doing this from a webservice,try it first from the prompt, so you have to contend only with the OSencoding (UTF-8 is highly recommended) and not the browser / serverencodings.

A more detailed example of the problem you're facing would help meunderstand the problem more.


Nader

Rasha wrote:

Dear Nader,

I Have a big problem during indexing pdfs containing Persian Word

lucenePDFIndexer cannot index it , and indexed words of pdf are unuseable


is there a way to perform it to index good?


regards,
rasha malek


--

Nader S. Henein
Senior Applications Architect

Bayt.com





---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene Arabic Internationalization Question

Reply via email to