Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Simon Wistow
On Wed, Oct 04, 2006 at 01:55:06PM +, eks dev said: > have you considered hadoop "light" mesagging RPC, should have > significantly smaller latencies than RMI Yes, it's one of the things I'm looking at. - To unsubscribe, e-

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread eks dev
have you considered hadoop "light" mesagging RPC, should have significantly smaller latencies than RMI - Original Message From: Simon Wistow <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday, 4 October, 2006 3:26:38 PM Subject: Re: Searching documents

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Scott
> Prelimary experimentation with a RemoteSearch/ParallelMultiSearcher > combination found that there were issues with the RMI causing > significant blocking. > > I'm currently playing around with trying alternative messaging > approaches so that I can also load balance requests. Wow, it is very i

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Simon Wistow
On Wed, Oct 04, 2006 at 08:14:38AM -0400, Haines, Ronald C. (LNG-DAY) said: > I too am interested in learning more about a large scale distributed > Lucene model. I'm also building a large scale (billions of documents) Lucene index. Prelimary experimentation with a RemoteSearch/ParallelMultiSear

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Scott
to:[EMAIL PROTECTED] Sent: Wednesday, October 04, 2006 7:33 AM To: java-user@lucene.apache.org Subject: Re: Searching documents on big index by using ParallelMultiSearcher is slow... OK, you're now officially beyond my competence, so I'll have to wait for people who actually know

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Scott
My index increases periodically. Now 1 sec for 10G indexes. I am worried that futurely, how about response time for 20G, 30G,,, and 50G indexes? I'll try remote Hits (result set) object and the SearchMaster merges top N of them. Thank you. Erick Erickson wrote: OK, you're now officially bey

RE: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Haines, Ronald C. \(LNG-DAY\)
he overall response time. I too am interested in learning more about a large scale distributed Lucene model. -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 04, 2006 7:33 AM To: java-user@lucene.apache.org Subject: Re: Searching documents on big inde

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-04 Thread Erick Erickson
OK, you're now officially beyond my competence, so I'll have to wait for people who actually know Although if I read your stats right, you're getting approximately 1 sec response time over 10M documents on a 10G index. That's not bad at all. What kind of response time do you need? On 10/3/0

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-03 Thread Scott
Hi, > Well, the first question is always "are you opening/closing your > IndexSearchers for each request on your remote machines?". This is always a > no-no. This is also a question for your single-searcher version. Yes I know, each search slave (RMI server) have single instance of IndexSearc

Re: Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-03 Thread Erick Erickson
Well, the first question is always "are you opening/closing your IndexSearchers for each request on your remote machines?". This is always a no-no. This is also a question for your single-searcher version. What is your performance if you only go to one server? I'd start by finding out what happen

Searching documents on big index by using ParallelMultiSearcher is slow...

2006-10-02 Thread Scott
Hi, I have a question about ParallelMultiSearcher performance. I want to search documents on about 10 gigabytes of index. (The index has 10,000,000 documents.) I get very slow performance using IndexSearcher with ONE index normally. Then I tried to use ParallelMultiSearcher with 10 servers of re