There is nothing in core lucene to do this and I don't recall seeing
anything in contrib. One approach would be to loop through all the
docs in the second index deleting them if present in first index,
commit that change, then merge the two indexes.
--
Ian.
On Mon, Feb 20, 2012 at 11:58 AM, Kar
Hi to you,
i've got a question concerning the merge of multiple indexes...
Suppose we have two indexes which contain the same content ...than the
result of a indexMerge is to have an index which contains the result
doubled..ok so far so good ...
But is there any chance/configuration that Lucen
Waits of several hours on a 4Gb index sounds very unlikely. Are you
sure there isn't something else going on that is blocking things?
What version of lucene? Decent, error-free, hardware?
As for optimize, I'd skip it altogether, or schedule it occasionally
when there is no or low activity on the
Hi,
Our Lucene index grew to about 4 GB .
Unfortunately it brought up a performance problem of slow file merging.
We have:
1. A writer thread: once an Hour it looks for modified documents, and
updates the Lucene index.
Usually there are only few modifications, but sometimes we switch the
entire co
Well ... yes and no?
Yes, the Log*MergePolicy will still at certain times merge the index
all the way down to one segment. If mergeFactor is 10 then this will
happen every "power of 10" flushed segments. Ie, after 10 flushes a
merge will merge them down to 1 segment, then after 100 flush
Thanks a lot Mike...one more question:
I remember reading that a regular addDocument call could basically
trigger an optimize on a given call. Is this true? Maybe not true anymore?
It doesnt sound right to me, but I do remember reading about it. This
was pre background merging when it was men
Yes this should reduce transient (while merging) disk usage.
However, optimize disregards this parameter, so it will still use the
same disk space. However, if you call optimize(N) then that should
use less space since it does not merge all the way down to 1 segment.
Note that the limit
If I use LogByteSizeMergePolicy#setMaxMergeMB, can I clamp down on the
space needed for optimize/merge? My thought is, if a segment is maxed
out, it will never need to be copied for a merge right? So you could
significantly reduce merge/optimize space requirments (now at like 2x-4x
if readers c
> But it also seems that the parallel/not parallel decision is
> something you control on the back end, so I'm not sure the user
> is involved in the merge question at all. In other words, you could
> easily split the indexing task up amongst several machines and/or
> processes and combine all the
OK, I think I'm getting a better handle here. I can't imagine how
it would work to combine indexes that use *different* analyzers
on the *same* field. Regardless of what Lucene did, you
simply could NOT explain this to a user. To take a simple example,
index part of your data for field1 with Keywor
> Then why would you want to combine them?
>
> I really think you need to explain what you're trying to accomplish
> rather then obsess on the details.
I have to create indexes in parallel because the amount of data is very
high.
Then I want to merge them into bigger indexes an move them to the s
Then why would you want to combine them?
I really think you need to explain what you're trying to accomplish
rather then obsess on the details.
Erick
On Jan 14, 2008 10:17 AM, <[EMAIL PROTECTED]> wrote:
> > I admit I've never used IndexMergeTool, I've always used
> > IndexWriter.AddIndexex and
> I admit I've never used IndexMergeTool, I've always used
> IndexWriter.AddIndexex and then execute
> IndexWriter.optimize().
>
> And I've seen no problems. That call takes no
> analyzer.
So you take the first index an add a remaining indexes via addIndexes?
What happens if the indexes were crea
I admit I've never used IndexMergeTool, I've always used
IndexWriter.AddIndexex and then execute
IndexWriter.optimize().
And I've seen no problems. That call takes no
analyzer.
Erick
On Jan 14, 2008 6:12 AM, <[EMAIL PROTECTED]> wrote:
> > See org.apache.lucene.misc.IndexMergeTool
>
> Thank you.
> See org.apache.lucene.misc.IndexMergeTool
Thank you.
But this uses a hardcoded analyzer and deprecated API-Calls.
How does the used analyzer effect the merge process?
Is everything reindexed with this new analyzer again? Does this make sense?
What if the sources indexes had other analyzers us
See IndexWriter.AddIndexes.
See org.apache.lucene.misc.IndexMergeTool
Erick
On Jan 13, 2008 12:10 PM, <[EMAIL PROTECTED]> wrote:
> Hi,
>
> are there any ready to use tools out there which I can use for merging and
> optimzing?
>
> I have seen that Luke can optimize, but not merge?
>
> Or do I ha
Hi,
are there any ready to use tools out there which I can use for merging and
optimzing?
I have seen that Luke can optimize, but not merge?
Or do I have to write my own utility?
Thank you
-
To unsubscribe, e-mail: [EMAIL PRO
Omar Didi wrote:
I have tried to use the isCurrent() method IndexReader to figure out
if an index is merging. but since I have to do this evrytime I need
to add a document, the performance got s slow.
here is what I am doing, I create 4 indexs and I am running with 4
threads. I do a round r
rging?
thanks for any hints,
- Omar
-Original Message-
From: Yonik Seeley [mailto:[EMAIL PROTECTED]
Sent: Monday, February 06, 2006 10:03 AM
To: java-user@lucene.apache.org
Subject: Re: index merging
On 2/6/06, Vanlerberghe, Luc <[EMAIL PROTECTED]> wrote:
> Sorry to contradict you
On 2/6/06, Vanlerberghe, Luc <[EMAIL PROTECTED]> wrote:
> Sorry to contradict you Yonik, but I'm pretty sure the commit lock is
> *not* locked during a merge, only while the "segments" file is being
> updated.
Oops, you're right. Good thing too... if the commit lock was held
during merges, one co
ile yet.
Luc
-Original Message-
From: Yonik Seeley [mailto:[EMAIL PROTECTED]
Sent: donderdag 2 februari 2006 22:25
To: java-user@lucene.apache.org
Subject: Re: index merging
On 2/2/06, Omar Didi <[EMAIL PROTECTED]> wrote:
> Thanks Yonik,
> I can't set the merge factor to
On 2/2/06, Omar Didi <[EMAIL PROTECTED]> wrote:
> Thanks Yonik,
> I can't set the merge factor too high because I will end up with the too many
> files open problem.
Right. I meant only for adding a lot of documents. After a lot of
adds, then you could set the mergefactor back down to a reasona
eeley [mailto:[EMAIL PROTECTED]
Sent: Thursday, February 02, 2006 3:53 PM
To: java-user@lucene.apache.org
Subject: Re: index merging
No, there isn't anything in the API to tell you that.
A merge may be triggered on any IndexWriter.add() call.
If you want to avoid merges, you can set the merge fac
No, there isn't anything in the API to tell you that.
A merge may be triggered on any IndexWriter.add() call.
If you want to avoid merges, you can set the merge factor really high
so that merges will never happen, and set maxBufferedDocs to the size
of the segments you want.
-Yonik
On 2/2/06, Om
Hi all,
I am trying to index in 2 different indexes with different mergefactors. so if
one of the indexes is merging than I want to index only on the other index.
is there a way to know if an index is merging?.
thanks
-
To uns
Hello all,
Whilst merging one index into another using
IndexWriter.addIndexes(IndexReader[]) I got the following error.
(index _file_path)\_5z.fnm (The system cannot find the file specified)
It would appear that this occurred during the adding of the indexes. The
indexes I was merging to an
26 matches
Mail list logo