On 4 Nov 2005, at 05:23, Manoj Kr. Sheoran wrote:
Hi Erik and all,
Thanks for the concern. We will test and let you know it also. We would really appreciate if any of you can tell some more about the architecture of Lucene searching system. How do you manages the iterator and what is the
method callback at query execution time(a broader view).

There really isn't any method callback, at least not in the way I'm thinking of it. When you search you get back Hits. Hits is an efficient data structure that caches up to the next 200 documents, re- searches to get successive documents when requested. A priority queue is the key to keeping only the top scoring (or sorted) document pointers around.

    Erik



Regards,
Manoj

----- Original Message -----
From: "Erik Hatcher" <[EMAIL PROTECTED]>
To: <java-user@lucene.apache.org>
Sent: Friday, November 04, 2005 3:41 PM
Subject: Re: Searching the contents



I certainly recommend testing this to see what kind of response times
you get for the first and successive searches after the caches are
built - be sure to use the same IndexReader for all searches to
benefit from caching :)

Sorting on 4-5 columns seems kind of extreme.  Sorting uses up RAM
with the caches, so it would surely be much more efficient to
collapse the fields you need to sort by into a single field if
possible, and try to do so numerically rather than as Strings.

     Erik



On 4 Nov 2005, at 02:35, Manoj Kr. Sheoran wrote:


Hi Chris,
Thanks for sharing your views with us. Is  sorting works properly.
Is there
any idea regarding time estimation for sort on 4-5 columns with 1
million
records?

Regards,
Manoj
----- Original Message -----
From: "Chris Lu" <[EMAIL PROTECTED]>
To: <java-user@lucene.apache.org>
Sent: Friday, November 04, 2005 12:30 PM
Subject: Re: Searching the contents




One of my case is using PIII 450MHz 256M RAM, with 1million records,
the search is around 3~4 seconds for the very first search, and
sub-second, usually under 0.5 second.

Sorting on one column will be longer, like 7~8 seconds for the first
sort. And subseconds with cache.

Chris Lu
---------------------------
Full-Text Search on Any Database
http://www.dbsight.net

On 11/3/05, Manoj Kr. Sheoran <[EMAIL PROTECTED]> wrote:


Hi,
I am having 1,00,000 documents in a index but in near future it
will be


3 million and more.  I am somewhat concerned about  the searhing
process
with this much number of document.  I am giving order on some
fields of
documents. Could anybody tell be the expected result from lucene
engine with
3 million results and order/sorted on 4 column.



Thanks,
Manoj





------------------------------------------------------------------- --
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]





-------------------------------------------------------------------- -
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to