RE: distributing the indexing process

2011-06-30 Thread Guru Chandar
index is that big that you need to index it on different hosts, are you sure you want it to be combine in a single index? Maybe it's a good idea to partition it? On Thu, Jun 30, 2011 at 12:12, Guru Chandar wrote: > > > If we have to index a lot of documents, is there a way to div

distributing the indexing process

2011-06-30 Thread Guru Chandar
If we have to index a lot of documents, is there a way to divide the documents into multiple sets and index them on multiple machines in parallel, and then merge the resulting indexes back into a single machine? If yes, will the result be logically equivalent to indexing all the documents on a s