date:20060225

Re: Indexing performance with Lucene 1.9

2006-02-25 Thread Daniel Naber

On Samstag 25 Februar 2006 14:20, Eric Jain wrote: > After upgrading to Lucene 1.9, an index that used to take about 9h to > build now requires 13h. Any one else notice a decrease in performance? Yes, I can reproduce this with the Lucene demo on a much smaller index of 2000 documents. It (partly

Re: Frequency of phrase

2006-02-25 Thread Chris Hostetter

: > If you use a span query then you can get the actual number of phrase : > instances. : : Thanks, good to know! Just to clarify, i Doug means you can use the getSpans method of a SpanNear query, and then count the iterations of next() untill you run out. Which is a really good idea I hadn't t

Re: Frequency of phrase

2006-02-25 Thread Eric Jain

Doug Cutting wrote: If you use a span query then you can get the actual number of phrase instances. Thanks, good to know! In this case (need to suggest phrase queries to the user) I've now settled with dividing the number of hits for a potential phrase by the number of documents that contain

Indexing performance with Lucene 1.9

2006-02-25 Thread Eric Jain

After upgrading to Lucene 1.9, an index that used to take about 9h to build now requires 13h. Any one else notice a decrease in performance? This is how I configure the IndexWriter: writer = new IndexWriter(dir, analyzer, false); writer.mergeFactor = 100; writer.minMergeDocs = 100; writ