Hi all,
The DSpace (www.dspace.org) currently uses Lucene to index metadata
(Dublin Core standard) and extracted full-text content of documents
stored in it. Now the system is being used globally, it needs to
support multi-language indexing.
I've looked through the mailing list archives etc. and
Thanks all for the useful comments.
It seems that there are even more options --
4/ One index, with a separate Lucene document for each (item,language)
combination, with one field that specifies the language
5/ One index, one Lucene document per item, with field names that include the
language