Thanks Erick, Using termdocs/termenum should work. One of my concerns is the performance: the search results could reach 100K, so the performance may be impacted. One of the alternative I am thinking is to collapse the data during indexing time, but I haven't decided to go that way.
----- Original Message ---- From: Erick Erickson <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Thursday, December 14, 2006 5:49:01 PM Subject: Re: Duplicates removal in search results you need to search for all documents with the title you care about, decide which one to keep and remove all the others. You'll probably need a TermDocs/TermEnum to go through all the items in your index to create the list of documents to remove. Erick On 12/14/06, qaz zaq <[EMAIL PROTECTED]> wrote: > > How can i remove the duplicates records in the search results. i.e., I > have multiple results with the same title in 'title' field, and I want to > only 1 record per title, how can I achieve that? thanks!! > > > --------------------------------- > Everyone is raving about the all-new Yahoo! Mail beta. > ____________________________________________________________________________________ Any questions? Get answers on any topic at www.Answers.yahoo.com. Try it now.