Re: Removing duplicate entries

2008-04-30 Thread João Rodrigues
>Probably something very like that, although you see none of that. Just >doing a deleteDocument(term) does it all for you. And I learned long ago >that the folks who write this kind of stuff can probably do it more >efficiently >than I can . And probably more efficiently that I can as well :) Than

Re: Removing duplicate entries

2008-04-30 Thread Erick Erickson
See below: On Tue, Apr 29, 2008 at 9:51 PM, João Rodrigues <[EMAIL PROTECTED]> wrote: > First of all, let me apologize for the double post but I got some strange > error message =\ > > >The first question is what do you mean the document > >is already in the index? Lucene doc IDs are useless > >h

Re: Removing duplicate entries

2008-04-29 Thread João Rodrigues
First of all, let me apologize for the double post but I got some strange error message =\ >The first question is what do you mean the document >is already in the index? Lucene doc IDs are useless >here since the ones in your FSDir and the ones in your >RAMdir are unrelated. In fact, I suspect tha

Re: Removing duplicate entries

2008-04-29 Thread Erick Erickson
The first question is what do you mean the document is already in the index? Lucene doc IDs are useless here since the ones in your FSDir and the ones in your RAMdir are unrelated. In fact, I suspect that the lucene docIDs will start at the same number in both. Lucene doc IDs are just monotonicall

Fwd: Removing duplicate entries

2008-04-29 Thread João Rodrigues
Hello all. Before I ask my question, I'd like to clarify I've read the manual and searched the archives, and if I'm here, it is because I've neither found a suitable answer, or (most likely) I didn't understand those which I did find :) I have an index built, which I update regularly. However, the

Removing duplicate entries

2008-04-29 Thread João Rodrigues
Hello all. Before I ask my question, I'd like to clarify I've read the manual and searched the archives, and if I'm here, it is because I've neither found a suitable answer, or (most likely) I didn't understand those which I did find :) I have an index built, which I update regularly. However, the