Re: delete by docid in lucene 4

2012-07-12 Thread Sean Bridges
Thanks for the advice everyone, I'll try updateDocument() for now. Sean On Thu, Jul 12, 2012 at 3:25 PM, Michael McCandless wrote: > On Thu, Jul 12, 2012 at 6:17 PM, Simon Willnauer > wrote: >> Sean seriously a couple of hundred docs a second, don't bother just >> use updateDocument. My benchma

Re: delete by docid in lucene 4

2012-07-12 Thread Michael McCandless
On Thu, Jul 12, 2012 at 6:17 PM, Simon Willnauer wrote: > Sean seriously a couple of hundred docs a second, don't bother just > use updateDocument. My benchmarks show that there is only a smallish > impact during indexing especially with concurrent flushing in lucene > 4. I don't know how resource

Re: delete by docid in lucene 4

2012-07-12 Thread Simon Willnauer
ll not come back. > > Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Original Message- >> From: Sean Bridges [mailto:sean.brid...@gmail.com] >> Sent: Thursday, July 12, 2012 9:

RE: delete by docid in lucene 4

2012-07-12 Thread Uwe Schindler
remen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Sean Bridges [mailto:sean.brid...@gmail.com] > Sent: Thursday, July 12, 2012 9:51 PM > To: java-user@lucene.apache.org; simon.willna...@gmail.com > Subject: Re: delete by docid in l

Re: delete by docid in lucene 4

2012-07-12 Thread Sean Bridges
>> ID). >>> >>> Uwe >>> >>> - >>> Uwe Schindler >>> H.-H.-Meier-Allee 63, D-28213 Bremen >>> http://www.thetaphi.de >>> eMail: u...@thetaphi.de >>> >>> >>>> -Original Message- >>>

Re: delete by docid in lucene 4

2012-07-12 Thread Simon Willnauer
eMail: u...@thetaphi.de >> >> >>> -Original Message- >>> From: Sean Bridges [mailto:sean.brid...@gmail.com] >>> Sent: Thursday, July 12, 2012 5:42 PM >>> To: java-user@lucene.apache.org; simon.willna...@gmail.com >>> Subject: Re: d

RE: delete by docid in lucene 4

2012-07-12 Thread Edward W. Rouse
> Sent: Thursday, July 12, 2012 11:50 AM > To: java-user@lucene.apache.org > Subject: Re: delete by docid in lucene 4 > > Does that return a Term which matches the lucene docId? What is the > value of Constants.DEFAULT_ID_FIELD ? > > Thanks, > Sean > > On Thu, J

Re: delete by docid in lucene 4

2012-07-12 Thread Sean Bridges
; To: java-user@lucene.apache.org; simon.willna...@gmail.com >> Subject: Re: delete by docid in lucene 4 >> >> We have indexer machines which are fed documents by other machines. >> If an error occurs (machine crashing etc) the same document may be sent to > an >> indexe

RE: delete by docid in lucene 4

2012-07-12 Thread Uwe Schindler
day, July 12, 2012 5:42 PM > To: java-user@lucene.apache.org; simon.willna...@gmail.com > Subject: Re: delete by docid in lucene 4 > > We have indexer machines which are fed documents by other machines. > If an error occurs (machine crashing etc) the same document may be sent to an

Re: delete by docid in lucene 4

2012-07-12 Thread Sean Bridges
Does that return a Term which matches the lucene docId? What is the value of Constants.DEFAULT_ID_FIELD ? Thanks, Sean On Thu, Jul 12, 2012 at 6:54 AM, Edward W. Rouse wrote: > I get around this by creating an id based term like: > > new Term(Constants.DEFAULT_ID_FIELD, id) > >> -Original

Re: delete by docid in lucene 4

2012-07-12 Thread Sean Bridges
We have indexer machines which are fed documents by other machines. If an error occurs (machine crashing etc) the same document may be sent to an indexer multiple times. Serial ids are assigned before documents reach the indexer, so a document, may be in the index multiple times, each time with th

RE: delete by docid in lucene 4

2012-07-12 Thread Edward W. Rouse
I get around this by creating an id based term like: new Term(Constants.DEFAULT_ID_FIELD, id) > -Original Message- > From: Sean Bridges [mailto:sean.brid...@gmail.com] > Sent: Wednesday, July 11, 2012 9:09 PM > To: java-user@lucene.apache.org > Subject: delete by docid in lucene 4 > > Is

Re: delete by docid in lucene 4

2012-07-12 Thread Simon Willnauer
On Thu, Jul 12, 2012 at 3:09 AM, Sean Bridges wrote: > Is it possible to delete by docId in lucene 4? I can delete by docid > in lucene 3 using IndexReader.deleteDocument(int docId), but that > method is gone in lucene 4, and IndexWriter only allows deleting by > Term or Query. that is correct.