RE: Preventing merging by IndexWriter

2006-10-19 Thread Johan Stuyts
> I just searched for 'faceted' on the e-mails I've seen since > I subscribed to > the list, and there are certainly discussions out there... I did already, but... > This thread might be particularly useful, started 15-May-2006 > *Aggregating category hits it seems I missed this one. Thanks.

Re: Preventing merging by IndexWriter

2006-10-18 Thread Erick Erickson
Your problem is out of my experience, so all I can suggest is that you search the list archive. I know the idea of faceted searching has been discussed by people with waaay more experience in that realm than I have and, as I remember, there were some links provided I just searched for 'facete

RE: Preventing merging by IndexWriter

2006-10-18 Thread Johan Stuyts
> > So my questions are: is there a way to prevent the IndexWriter from > > merging, forcing it to create a new segment for each indexing batch? > > Already done in the Lucene trunk: > http://issues.apache.org/jira/browse/LUCENE-672 > > Background: > http://www.gossamer-threads.com/lists/lucene/j

RE: Preventing merging by IndexWriter

2006-10-18 Thread Johan Stuyts
> Why go through all this effort when it's easy to make your > own unique ID? > Add a new field to each document "myuniqueid" and fill it in > yourself. It'll > never change then. I am sorry I did not mention in my post that I am aware of this solution but that it cannot be used for my purposes.

Re: Preventing merging by IndexWriter

2006-10-17 Thread Erick Erickson
True. But is it enough faster than TermDocs.seek(new Term("unique id", id)).doc() to be worth the complication for this situation? ... Erick On 10/17/06, Daniel Noll <[EMAIL PROTECTED]> wrote: Erick Erickson wrote: > Why go through all this effort when it's easy to make your own unique ID? I

Re: Preventing merging by IndexWriter

2006-10-17 Thread Daniel Noll
Erick Erickson wrote: Why go through all this effort when it's easy to make your own unique ID? I can think of one reason: hits.id() is orders of magnitude faster than hits.doc(). Daniel -- Daniel Noll Nuix Pty Ltd Suite 79, 89 Jones St, Ultimo NSW 2007, AustraliaPh: +61 2 9280 0699 W

Re: Preventing merging by IndexWriter

2006-10-17 Thread Erick Erickson
Why go through all this effort when it's easy to make your own unique ID? Add a new field to each document "myuniqueid" and fill it in yourself. It'll never change then. The complex coordination way. To coordinate things, you could keep the last ID used (and maybe other information) in a unique

Re: Preventing merging by IndexWriter

2006-10-17 Thread Yonik Seeley
On 10/17/06, Johan Stuyts <[EMAIL PROTECTED]> wrote: So my questions are: is there a way to prevent the IndexWriter from merging, forcing it to create a new segment for each indexing batch? Already done in the Lucene trunk: http://issues.apache.org/jira/browse/LUCENE-672 Background: http://www

Re: Preventing merging by IndexWriter

2006-10-17 Thread Erick Erickson
Ignore the bit about keeping the mappings, it's too tricky unless really really necessary, since by virtue of updating the meta-data document, you'll delete a document, thus perhaps changing the Lucene IDs. I should proofread before hitting the "send" button ... Erick On 10/17/06, Erick Erickso