Re: Long Query Performance

2007-01-24 Thread mark harwood
ssage From: Somnath Banerjee <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday, 24 January, 2007 5:02:49 AM Subject: Re: Long Query Performance Here is the code. Let me know if you need any clarification // MaxConcepts is set to 100 long stTime = System.currentTimeMi

Re: Long Query Performance

2007-01-23 Thread Somnath Banerjee
Here is the code. Let me know if you need any clarification // MaxConcepts is set to 100 long stTime = System.currentTimeMillis(); // bq is the Boolean query constructed out of the title of the query document TopDocs docs = searcher.search(bq, null, MaxConcepts); // Store the title of the resu

Re: Long Query Performance

2007-01-23 Thread Chris Hostetter
: Chris, I was measuring the query time only. I have increased the heap size that's still doesn't tell us what you are doing -- "query time" can mean a lot of things ... are you using the Hits class? are you iterating over results? are you pulling out stored fields? are you sorting? are you using

Re: Long Query Performance

2007-01-23 Thread Somnath Banerjee
Thanks for all the reply. I'll try the methods suggested by you will post the result of my experiment. Chris, I was measuring the query time only. I have increased the heap size of java to 1 GB. Now, 5 - 8 words query is taking about 0.1 - 0.4 second. That's reasonable I guess. Thanks, Somnath

Re: Long Query Performance

2007-01-22 Thread Chris Hostetter
: I tried by using only the title of (5-8 words) the query documents instead : of full text of the document. But that is also taking 0.5-1 sec for each i can't speak to your 200-300 word BooleanQuery timings, but 0.5-1 seconds for an 8 clause BooleanQuery is very very wrong. i suspect there's so

Re: Long Query Performance

2007-01-22 Thread Paul Elschot
anerjee <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Monday, 22 January, 2007 1:28:20 PM > Subject: Re: Long Query Performance > > Thanks for the reply. Good guess I think. > > DB (Index) is basically a collection of encyclopedia documents. Queries are &g

Re: Long Query Performance

2007-01-22 Thread mark harwood
ath Banerjee <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, 22 January, 2007 1:28:20 PM Subject: Re: Long Query Performance Thanks for the reply. Good guess I think. DB (Index) is basically a collection of encyclopedia documents. Queries are also a collection of documents but of v

Re: Long Query Performance

2007-01-22 Thread Michael D. Curtin
Somnath Banerjee wrote: Thanks for the reply. Good guess I think. DB (Index) is basically a collection of encyclopedia documents. Queries are also a collection of documents but of various domains. My task is to find out for each "query document" top 100 matching encyclopedia contents. I tried b

Re: Long Query Performance

2007-01-22 Thread Somnath Banerjee
Thanks for the reply. Good guess I think. DB (Index) is basically a collection of encyclopedia documents. Queries are also a collection of documents but of various domains. My task is to find out for each "query document" top 100 matching encyclopedia contents. I tried by using only the title of

Re: Long Query Performance

2007-01-22 Thread Michael D. Curtin
Somnath Banerjee wrote: I have created a 8GB index of almost 2 million documents. My requirement is to run nearly 0.72 million query on this index. Each query consists of 200 - 400 words. I have created a Boolean Query by ORing these words. But each query is taking nearly 5 - 10 secon

Long Query Performance

2007-01-22 Thread Somnath Banerjee
Hi All, I have created a 8GB index of almost 2 million documents. My requirement is to run nearly 0.72 million query on this index. Each query consists of 200 - 400 words. I have created a Boolean Query by ORing these words. But each query is taking nearly 5 - 10 seconds to execute ( 2