Reconstructing a field from an index is
1> slow
2> lossy (what about stemmed words? stopwords? )

UNLESS you have stored the data (Field.Store.YES/COMPRESS),
in which case you can just get the field from each index and put it
in the new one. Tokenization has little to do with this although you
could get a similar effect with untokenized fields but why would you
want to?

I assume you have a way to uniquely identify the documents that you
want to combine, relying on the Lucene doc ID is fragile....

If possible, your best bet would be to reconstruct the new index
from the source you used to create the original indexes.

Maybe a higher-level problem statement would help generate
more suggestions.

Best
Erick

On Thu, Feb 26, 2009 at 7:07 AM, liat oren <oren.l...@gmail.com> wrote:

> Hi,
>
> I have two indexes, each has a tokenized field and I would like to combine
> them both into one field in a new index.
> How can it be done?
> (Is it a good approach or is it better to hold them as untokenized text and
> only when I create the new index, then to tokenize it?)
>
> Many thanks,
> Liat
>

Reply via email to