Index design question

2005-08-05 Thread N. C. Deepak Ramesh
Hi list, Let me describe my issue taking a simpler model. Lets say I were to build a blog which allows each post to have multiple keywords. I want to provide a search over the posts but restricted to a subset of the keywords (say - python, windows, etc.). How can I structure the index in this c

Re: Lock Issue

2005-08-05 Thread Volodymyr Bychkoviak
index is locked when IndexReader is opening index. After that (when index structure is read) lock is released. Aigner, Thomas wrote: Howdy all, I am running some tests against our search. I have created a script that will run 30 simultaneous executions of 25 searches each. The 25 sea

Lock Issue

2005-08-05 Thread Aigner, Thomas
Howdy all, I am running some tests against our search. I have created a script that will run 30 simultaneous executions of 25 searches each. The 25 searches are randomized but the same data is used. I am consistently getting "Lock obtain timed out: Lock@/tmp/lucene-77053e9cdda0a530f729e7

RE: Split Search Word

2005-08-05 Thread Mordo, Aviran (EXP N-NANNATEK)
The StandardAnalyzer should work just fine with it, It will break the search string to 5 search terms. HTH Aviran http://www.aviransplace.com _ From: Karthik N S [mailto:[EMAIL PROTECTED] Sent: Friday, August 05, 2005 1:57 AM To: LUCENE Subject: Split Search Word Hi Luceners Apol

RE: Search shortly after adding a doc

2005-08-05 Thread Mordo, Aviran (EXP N-NANNATEK)
You can try working with two indexes one for all of today's messages which will be pretty small, and another for past messages. Then once a day merge the small index to the big one and start fresh. This way you need only to open an IndexReader for the small index while the big one does not change.

Re: de pluralization

2005-08-05 Thread Andrew Boyd
You might want to look at stemming for "de pluralization" it boils down words to their "root" So bombs and bomming get stemmed to bomb. I'm using the snowball stemmer, which handles different languages as well as english. It is in the sandbox. org.apache.lucene.analysis.snowball.SnowballFilt

Re: de pluralization

2005-08-05 Thread Dan Armbrust
Mufaddal Khumri wrote: Are there analyzers that do this already? Its not an analyzer, but the "norm" feature of this tool does a good job at getting to the normalized form of the words... http://umlslex.nlm.nih.gov/lvg/current/ http://umlslex.nlm.nih.gov/lvg/current/docs/userDoc/norm.htm

SV: SV: Search in multi fields with cross field AND ?

2005-08-05 Thread erlend.bjorge
Hi Martin! It's starts to work know, thanks !! :-) >In my case "text" is everything. Whenever I add anything to the index, I >also add a "text" Field. So in your case, I would build the index like >you did with an additional >doc.add(Field.Text("text", bean.getCompanyName())); I see, but I do

Re: setBoost(float) in org.apache.lucene.document.Field cannot be applied to (double)???

2005-08-05 Thread Riccardo Daviddi
It seems it doesn't change the boost factors and that is strange, I did as in the manual, I thought it was easy, I just use: fieldname.setBoost(1.2f); and then I add to the doc that field, and I index the doc. All the indexing process goes ok, only these boost factors are different from what I do