Re: checking existing docs before indexing

2007-07-12 Thread Neeraj Gupta
provided. Cheers, Neeraj "Samuel LEMOINE" <[EMAIL PROTECTED]> 07/12/2007 09:38 PM Please respond to java-user@lucene.apache.org To java-user@lucene.apache.org cc [EMAIL PROTECTED] Subject Re: checking existing docs before indexing Neeraj Gupta a écrit : >

Re: checking existing docs before indexing

2007-07-12 Thread Samuel LEMOINE
Neeraj Gupta a écrit : Hi, You an use updateDocument() method of IndexWriter to update any existing document.. It searches for a document matching the Term, if document existes then delete that document. After that it adds the provided document to the indexes in both the cases whether documen

Re: checking existing docs before indexing

2007-07-12 Thread Neeraj Gupta
Hi, You an use updateDocument() method of IndexWriter to update any existing document.. It searches for a document matching the Term, if document existes then delete that document. After that it adds the provided document to the indexes in both the cases whether document exists or not. Cheers,

Re: checking existing docs before indexing

2007-07-12 Thread Erick Erickson
You have to check yourself. Lucene has no concept of relations *between* documents. What you're really asking for is something like a database unique key. No such luck, you have to create one yourself. What I've done is post-process the entire index, removing duplicates. This can be done quite ef