RE: search hits not returned until I stop and restart application

2010-06-21 Thread Steven A Rowe
Andy, I think batching commits either by time or number of documents is common. Do you know about NRT (Near Realtime Search)?: . Using IndexWriter.getReader(), you can avoid commits altogether, as well as reducing update->search latency.

RE: search hits not returned until I stop and restart application

2010-06-21 Thread andynuss
"Maybe you aren't using the IndexReader instance returned by reopen(), but instead are continuing to use the instance on which you called reopen()? It's tough to figure this kind of thing out without looking at the code." That was it, I was not using the newly (re)opened index. By the way, one

RE: search hits not returned until I stop and restart application

2010-06-21 Thread Steven A Rowe
Andy, it sounds like you're doing the right thing. Maybe you aren't using the IndexReader instance returned by reopen(), but instead are continuing to use the instance on which you called reopen()? It's tough to figure this kind of thing out without looking at the code. For example, what do yo

RE: search hits not returned until I stop and restart application

2010-06-21 Thread andynuss
"So you gotta call commit() or close(). Once you've done that, you can reduce the (expensive) cost of opening a new IndexReader by calling reopen(): " Steve, I tried this, and I must have done something wrong. After my document set was ingested, I called a function which (1) called the IndexWr

RE: search hits not returned until I stop and restart application

2010-06-21 Thread Steven A Rowe
Hi Andy, From the API docs for IndexWriter : [D]ocuments are added with addDocument and removed with deleteDocuments(Term) or deleteDocuments(Query). A document can be updated with updat

search hits not returned until I stop and restart application

2010-06-21 Thread andynuss
Hi, I have an IndexWriter singleton in my program, and an IndexSearcher singleton based on a readonly IndexReader singleton. When I use the IndexWriter to index a large document to lucene, and then, while the program is still running, use my previously created IndexSearcher to find hits in that

Re: Question from a new user : IndexSearcher.doc

2010-06-21 Thread Erick Erickson
They're quite different beasts to use. SOLR will have you up and running with some configuration very quickly, and if you're comfortable with servlet containers, it'll be even faster. It has a DIH handler which will index data from a database (again, with some configuration, but not necessarily pro

Re: segment_N file is missed

2010-06-21 Thread maryam ma'danipour
That's great. I'll try it. thanks On Sat, Jun 19, 2010 at 11:10 AM, Lance Norskog wrote: > This code is old (2006!) and I've updated it for Lucene 2.9.2, ad the > trunk.This version only works for one CFS file (that I've tested). The > code does not check versions carefully. Here are both versi

Re: Strange behaviour of StandardTokenizer

2010-06-21 Thread Anna Hunecke
Hi! Basically, what I want is something that removes punctuation. But I realized now that things like email or number recognition are also very useful if I want to give suggestions. I want to be able to give 'nl-lt001' as a suggestion when the user enters 'nl'. This would of course not be possi

Re: Question from a new user : IndexSearcher.doc

2010-06-21 Thread Victor Kabdebon
Hi Erick, Thank you very much for you explanations. 588 is a rather long way to go, so you're right maybe I won't need at the moment to care about that problem. To answer your final question : no indeed I won't need to store a lot of data. Just some keys in order to find the data in Cassandra lat