Hi,
I saw that article and it tells me that increasing the mergeFactor speeds up
the indexing. But the reverse had happened in my case.
To be more specific I had conducted some experiments for 1000 documents. The
time taken is quite large, due to pdf file indexing. I had changed the
indexwriter's parameters.

MergeFactor – default(10)
minMergeDocs – default(10)
Time taken – 690 sec

MergeFactor – 50
minMergeDocs – default(10)
Time taken – 765 sec
MergeFactor – default(10)
minMergeDocs – 100
Time taken – 670 sec

MergeFactor –100
minMergeDocs – 100
Time taken – 738 sec
Increasing the mergeFactor did not speed up, but increasing the minMergeDocs
had improved. I am using Lucene.Net.
Can you explain the behavior. I am confused

On 2/10/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:

Sairaj, see http://www.onjava.com/pub/a/onjava/2003/03/05/lucene.html

Increase your maxBufferedDocs.

Otis

----- Original Message ----
From: Sairaj Sunil <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Friday, February 9, 2007 11:14:50 AM
Subject: Merge factor problem,

Hi all,
I have increased the merge factor from 10 to 50. I thought the indexing
performance will be better. But the time taken taken to index is more than
the time taken for the merge factor of 10. The documentation and some
articles say that the time taken to index will improve if the merge factor
is increased.
I have changed the merge factors to 50, 100, 1000. I have left the
minMergeDocs to be the default value for all the cases. The time taken to
index same number of documents increased in a linear fashion, which is
exactly opposite according to the info I have read.
Is this the correct behavior. In which cases this behavior happens?

Regards
--
Sairaj Sunil




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




--
Sairaj Sunil
II Mtech(CS)
SSSIHL
Prashanthi Nilayam

Reply via email to