Hi, I saw that article and it tells me that increasing the mergeFactor speeds up the indexing. But the reverse had happened in my case. To be more specific I had conducted some experiments for 1000 documents. The time taken is quite large, due to pdf file indexing. I had changed the indexwriter's parameters.
MergeFactor – default(10) minMergeDocs – default(10) Time taken – 690 sec MergeFactor – 50 minMergeDocs – default(10) Time taken – 765 sec MergeFactor – default(10) minMergeDocs – 100 Time taken – 670 sec MergeFactor –100 minMergeDocs – 100 Time taken – 738 sec Increasing the mergeFactor did not speed up, but increasing the minMergeDocs had improved. I am using Lucene.Net. Can you explain the behavior. I am confused On 2/10/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
Sairaj, see http://www.onjava.com/pub/a/onjava/2003/03/05/lucene.html Increase your maxBufferedDocs. Otis ----- Original Message ---- From: Sairaj Sunil <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, February 9, 2007 11:14:50 AM Subject: Merge factor problem, Hi all, I have increased the merge factor from 10 to 50. I thought the indexing performance will be better. But the time taken taken to index is more than the time taken for the merge factor of 10. The documentation and some articles say that the time taken to index will improve if the merge factor is increased. I have changed the merge factors to 50, 100, 1000. I have left the minMergeDocs to be the default value for all the cases. The time taken to index same number of documents increased in a linear fashion, which is exactly opposite according to the info I have read. Is this the correct behavior. In which cases this behavior happens? Regards -- Sairaj Sunil --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
-- Sairaj Sunil II Mtech(CS) SSSIHL Prashanthi Nilayam