RE: lucene Indexer failed to close, but later indexing still OK?

2012-08-06 Thread Zhang, Lisheng
Hi, Since last time I reported the issue below, I had a few similar issue, like at one time when I ran CheckIndex, I got error like some delete file not found, but after a few seconds CheckIndex is 100% OK. Is this a problem only happening in 2.3.2 but fixed in later release (sometimes data is

Re: Small Vocabulary

2012-08-06 Thread Mike Sokolov
There was some interesting work done on optimizing queries including very common words (stop words) that I think overlaps with your problem. See this blog post http://www.hathitrust.org/blogs/large-scale-search/slow-queries-and-common-words-part-2 from the Hathi Trust. The upshot in a nutshel

[ANNOUNCE] Lucene/Solr @ ApacheCon Europe - August 13th Deadline for CFP and Travel Assistance applications

2012-08-06 Thread Chris Hostetter
ApacheCon Europe will be happening 5-8 November 2012 in Sinsheim, Germany at the Rhein-Neckar-Arena. Early bird tickets go on sale this Monday, 6 August. http://www.apachecon.eu/ The Lucene/Solr track is shaping up to be quite impressive this year, so make your plans to attend an

Re: Directory flushing / commit / openIfChanged

2012-08-06 Thread Simon Willnauer
hey harald, On Mon, Aug 6, 2012 at 1:22 PM, Harald Kirsch wrote: > Hi, > > in my application I have to write tons of small documents to the index, but > with a twist. Many of the documents are actually aggregations of pieces of > information that appear in a data stream, usually close together, b

Re: questions about DocValues in 4.0 alpha

2012-08-06 Thread Simon Willnauer
hey, On Mon, Aug 6, 2012 at 11:34 AM, Li Li wrote: > hi everyone, > in lucene 4.0 alpha, I found the DocValues are available and gave > it a try. I am following the slides in > http://www.slideshare.net/lucenerevolution/willnauer-simon-doc-values-column-stride-fields-in-lucene > I have g

Directory flushing / commit / openIfChanged

2012-08-06 Thread Harald Kirsch
Hi, in my application I have to write tons of small documents to the index, but with a twist. Many of the documents are actually aggregations of pieces of information that appear in a data stream, usually close together, but nevertheless merged with information for other documents. When info

Re: questions about DocValues in 4.0 alpha

2012-08-06 Thread Uwe Schindler
You have to call getTopReaderContext on the directory reader and can loop easily over the leaves using leaves(). All docbases are then relative to the directory reader. If you get the top reader context from the atomic reader itsself its only relative to itsself, which does not help. getSequent

questions about DocValues in 4.0 alpha

2012-08-06 Thread Li Li
hi everyone, in lucene 4.0 alpha, I found the DocValues are available and gave it a try. I am following the slides in http://www.slideshare.net/lucenerevolution/willnauer-simon-doc-values-column-stride-fields-in-lucene I have got 2 questions. 1. is DocValues updatable now? 2. How