Re: why the of advance(int target) function of DocIdSetIterator is defined with uncertain?

2012-04-18 Thread Li Li
Michael McCandless wrote: So... the good news is I made a new scorer (basically copied DisjunctionMaxScorer and then tweaked from there) that scores the OR-only case. All tests pass w/ this new scorer. And more good news is that if you don't score (I sort by doctitle to do that), you get a speedu

Re: Slow merging after upgrading to 3.5

2012-04-18 Thread Michael McCandless
Super, thanks for bringing closure! Mike McCandless http://blog.mikemccandless.com On Wed, Apr 18, 2012 at 5:33 PM, Ivan Brusic wrote: > Just wanted to circle back and report on our progress. > > We finally applied the settings to our production environment and the > improvements have been dram

Re: Slow merging after upgrading to 3.5

2012-04-18 Thread Ivan Brusic
Just wanted to circle back and report on our progress. We finally applied the settings to our production environment and the improvements have been dramatic. Our indexing time has returned to 2.3 levels. Thanks again, Ivan On Fri, Apr 6, 2012 at 11:36 AM, Michael McCandless wrote: > On Thu, Ap

RE: Field value vs TokenStream

2012-04-18 Thread Uwe Schindler
Hi, You should inform yourself about the difference between "stored" and "indexed" fields: The tokens in the ".tis" file are in fact the analyzed tokens retrieved from the TokenStream. This is controlled by the Field parameter Field.Index. The Field.Store parameter has nothing to do with indexing:

Field value vs TokenStream

2012-04-18 Thread Carsten Schnober
Dear list, I'm studying the Lucene index file formats and I wonder: after having initialized a field with Field(String name, String value, Field.Store store, Field.Index index), where is the value String stored? I understand that the chosen analyzer does its processing on that value, including tok