RE: Multiple Language Indexing and Searching

James Adams Tue, 06 Sep 2005 05:32:44 -0700

Surely it's best to have a specific analyzer for each language?

Would support for multiple Analyzers with a single index require a
different IndexWriter for each Analzser/language?  Would you then need
to manage the disk access of these regarding locking etc, so two
IndexWriter's can not do so at the same time?





-----Original Message-----
From: Olivier Jaquemet [mailto:[EMAIL PROTECTED] 
Sent: 06 September 2005 13:21
To: [email protected]
Subject: Re: Multiple Language Indexing and Searching

Gusenbauer Stefan wrote:

>I think nutch uses ngramj for language classification but i don't know
>what type of saving language information they use. In our application
>for example i save the language in an extra field in the document
>because lucene is supporting multiple fields with the same names we
>would be able to handle different languages. but for now we don't need
it
>  
>
But then, if you do so, you do not benefit from any specialized Analyzer

you could use for each language, do you?
Then again, maybe it's not that interesting to use specialized analyzers

for each language?.



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: Multiple Language Indexing and Searching

Reply via email to