RE: IndexWriter.addIndexes

2006-03-21 Thread Frank Kunemann
Ok, thank you Otis! Frank -Original Message- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 21, 2006 4:44 PM To: java-user@lucene.apache.org Subject: Re: IndexWriter.addIndexes Hi, Yes, no IOException means all went well, I believe. Otis - Original Messag

lucene highlighter

2006-03-21 Thread Raghavendra Prabhu
Hi guys If anyone can tell me how to get the best fragments using the highligher The query has two terms - term1 and term2 The search result display only term1 in the highlighter whereas term2 is also there. How can i adjust the lucene highlighter to make sure that atleast each term is displaye

Re: Re-creating IndexSearcher after update

2006-03-21 Thread Yonik Seeley
On 3/20/06, Stephen Gray <[EMAIL PROTECTED]> wrote: > I want to re-create the IndexSearcher whenever the index changes, but I'm > assuming that if I do this bad things will happen to people who may be > currently running a search, or paging through a Hits collection created by > a previous search (

Re: Multi Search vs reader?

2006-03-21 Thread Yonik Seeley
On 3/21/06, Brian <[EMAIL PROTECTED]> wrote: > I had assumed since I had multiple indexes, > that meant I need to perform a MultiSearch. However > several people have mentioned MultiReader. What's the > difference, and what's the benefit of one over the > other?? A MultiSearcher can search over Se

Multi Search vs reader?

2006-03-21 Thread Brian
I've been doing some reading and I'm a bit confused, Could you help me clear this up. I have roughly 12 indexes available at any 1 time for a search. I had assumed since I had multiple indexes, that meant I need to perform a MultiSearch. However several people have mentioned MultiReader. What's th

Re: Read past EOF error in Windows

2006-03-21 Thread msftblows
I had a problem in the past with security on the folder where your index is located...but your error does not seem to show that ... I would check anyway though... -Original Message- From: Chris Cain <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Tue, 21 Mar 2006 15:33:26 +00

stemming question

2006-03-21 Thread msftblows
if I index with porter stemmer, what is the "overhead" that I incure (if any) with searching... the reason I ask is that I am planning on switching out porter for kstemmer...and my boss is curious in terms of cpu cycles what the overhead would be (more or less). Granted the KStemmer is differe

Re: how to cluster documents

2006-03-21 Thread Valerio Schiavoni
Hi Grant, i think what is more relevan is what you wrote here: http://www.cnlp.org/apachecon2005/ about domain specialization, but it wasn't very (maybe because only 4 slides) On 3/21/06, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > > You might want to look at the Carrot2 project > (http://www.c

Adding delta terms to the same lucene doc

2006-03-21 Thread prasenjitm
Hi folks, I have a requirement where I want to keep adding deltas (incremental terms) to a particular document in a lucene index, and get these additional terms indexed to the SAME document in the index. Is there an easy/smart way to do it without changing much of lucene ? Deleting and addi

Re: how to cluster documents

2006-03-21 Thread Grant Ingersoll
You might want to look at the Carrot2 project (http://www.carrot2.org/website/xml/index.xml). It does clustering and has support for Lucene. Valerio Schiavoni wrote: Hello, not sure if the term 'cluster' is the correct one, but here what i would like to do: given I have a small set of categori

Re: Term Vector Question

2006-03-21 Thread Grant Ingersoll
Daniel, Not sure I understand your problem, could you expand on it more, please. Daniel Cortes wrote: Hi everybody, How can I do to obtain Terms of an specific Querry( for example all the terms of a specific group, field " ID_GROUP" ) The only thing that now I think to do is a search and obtai

RE: Improving search performance

2006-03-21 Thread Satuluri, Venu_Madhav
You are right. I was unnecessarily transferring all the results from Hits object to an ArrayList. I don't know why it never struck me but this was the step that was taking a lot of time; it was staring at me all the time. Thanks, it's running much better now. Venu -Original Message- Fro

Re: IndexWriter.addIndexes

2006-03-21 Thread Otis Gospodnetic
Hi, Yes, no IOException means all went well, I believe. Otis - Original Message From: Frank Kunemann <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Tuesday, March 21, 2006 7:29:16 AM Subject: IndexWriter.addIndexes Hi, all I want to know about IndexWriter.addIndexes() is

Re: Improving search performance

2006-03-21 Thread Grant Ingersoll
I am not sure why you are getting all 60k docs at a time. If you use the Hits object, it caches the top 50 or so, but doesn't retrieve all the documents at once. Also, what are the size of your fields and how many fields do you have per document? Have you done any profiling to find the bott

Re: Grouping results by choosen field

2006-03-21 Thread Java Programmer
On 3/17/06, Java Programmer <[EMAIL PROTECTED]> wrote: > > Hello, > I tried to search myself for soultion, but without any good result, so I want > to ask group. > My problem concerns result grouping, the best example will be Google search > where you have results sorted by relevance, and also gr

RE: Grouping results by choosen field

2006-03-21 Thread anton
Good grouping by domain realized in nutch... Nutch can serve good example of group on certain field. -Original Message- From: Java Programmer [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 21, 2006 3:56 PM To: java-user@lucene.apache.org Subject: Re: Grouping results by choosen field On

Re: wildcard support

2006-03-21 Thread Erik Hatcher
If you've rewritten a WildcardQuery and get an empty query, that means that no terms matched the wildcard expression in the index pointed to by the IndexReader you provided. Erik On Mar 21, 2006, at 5:08 AM, Raghavendra Prabhu wrote: Hi I am using the highlightertest.java to extr

Re: how to cluster documents

2006-03-21 Thread jason
I guess you should use some text mining tools. you can use googl find them. I remember UIUC recently releases one tool. It is very good. On 3/21/06, Valerio Schiavoni <[EMAIL PROTECTED]> wrote: > > Hello, > not sure if the term 'cluster' is the correct one, but here what i would > like to do: > gi

Improving search performance

2006-03-21 Thread Satuluri, Venu_Madhav
Hi, I am looking for ways to improve the performance of lucene search in our app. Lucene performance is visibly slow when there are a lot of documents to be returned (performance almost seems directly proportional to the number of documents returned by Searcher). However, we show 20 results per pa

IndexWriter.addIndexes

2006-03-21 Thread Frank Kunemann
Hi, all I want to know about IndexWriter.addIndexes() is if there is a way to tell if merging was successful or not. Or can I be sure that if no IOException is thrown, everything is fine and I can delete the old indexes? Regards Frank ---

Read past EOF error in Windows

2006-03-21 Thread Chris Cain
Hi all, I wrote a lucene program which runs fine under Linux and Mac but fails on most Windows machines. (I have managed to get it to work on one version of XP however) Specifically when i open or search the index i get the following error message. Any help would be appreciated, Cheers, Chri

how to cluster documents

2006-03-21 Thread Valerio Schiavoni
Hello, not sure if the term 'cluster' is the correct one, but here what i would like to do: given I have a small set of categories; i manually defined some keywords for each category. ie: -spielberg: ET, munich, indiana jones; -sport: football, basket, volley, etc etc; then, i have a quite large

RE: Re-creating IndexSearcher after update

2006-03-21 Thread Vanlerberghe, Luc
Yep, I created DelayCloseIndexSearcher just for this scenario and it's running in production for about half a year now... There's an usage example in the javadoc, but it can be optimised even more (without touching the code that does the searches, handles the hits, etc...). In my production envi

wildcard support

2006-03-21 Thread Raghavendra Prabhu
Hi I am using the highlightertest.java to extract the wild card terms. I use the queryParser to parse my queryString Then i store the text in a RAM directory ( which i want to scan) and then rewrite it as mentioned in the highlighter example query=query.rewrite(reader) Now if i print the quer

Please tell me some of the Apache Search Engines

2006-03-21 Thread Babu, KameshNarayana \(GE, Research, consultant\)
Title: Please tell me some of the Apache Search Engines HI All, Can anybody tell me some of the open source projects for searching the net. My exact requirement is, if i specify the URL and keyword. Based on the given keyword the application should search on the particular URL and give me

Re: restart interrupted index

2006-03-21 Thread Rob Young
Paulo Silveira wrote: Chris, I really would like only this extra files, but I have the same problem here. If I interrupt my IndexWriter with a kill signal, must of the time I will be left with a lock file AND corrupted index files (the searcher will throw some IllegalStateExceptions after the

RE: Re-creating IndexSearcher after update

2006-03-21 Thread Koji Sekiguchi
Hi Steve, DelayCloseIndexSearcher may suit your requirement? Please check: http://issues.apache.org/jira/browse/LUCENE-445 Hope this helps. Koji - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [