On 4 Nov 2005, at 05:23, Manoj Kr. Sheoran wrote:
Hi Erik and all,
Thanks for the concern. We will test and let you know it also. We
would
really appreciate if any of you can tell some more about the
architecture of
Lucene searching system. How do you manages the iterator and what
is the
method callback at query execution time(a broader view).
There really isn't any method callback, at least not in the way I'm
thinking of it. When you search you get back Hits. Hits is an
efficient data structure that caches up to the next 200 documents, re-
searches to get successive documents when requested. A priority
queue is the key to keeping only the top scoring (or sorted) document
pointers around.
Erik
Regards,
Manoj
----- Original Message -----
From: "Erik Hatcher" <[EMAIL PROTECTED]>
To: <java-user@lucene.apache.org>
Sent: Friday, November 04, 2005 3:41 PM
Subject: Re: Searching the contents
I certainly recommend testing this to see what kind of response times
you get for the first and successive searches after the caches are
built - be sure to use the same IndexReader for all searches to
benefit from caching :)
Sorting on 4-5 columns seems kind of extreme. Sorting uses up RAM
with the caches, so it would surely be much more efficient to
collapse the fields you need to sort by into a single field if
possible, and try to do so numerically rather than as Strings.
Erik
On 4 Nov 2005, at 02:35, Manoj Kr. Sheoran wrote:
Hi Chris,
Thanks for sharing your views with us. Is sorting works properly.
Is there
any idea regarding time estimation for sort on 4-5 columns with 1
million
records?
Regards,
Manoj
----- Original Message -----
From: "Chris Lu" <[EMAIL PROTECTED]>
To: <java-user@lucene.apache.org>
Sent: Friday, November 04, 2005 12:30 PM
Subject: Re: Searching the contents
One of my case is using PIII 450MHz 256M RAM, with 1million
records,
the search is around 3~4 seconds for the very first search, and
sub-second, usually under 0.5 second.
Sorting on one column will be longer, like 7~8 seconds for the
first
sort. And subseconds with cache.
Chris Lu
---------------------------
Full-Text Search on Any Database
http://www.dbsight.net
On 11/3/05, Manoj Kr. Sheoran <[EMAIL PROTECTED]> wrote:
Hi,
I am having 1,00,000 documents in a index but in near future it
will be
3 million and more. I am somewhat concerned about the searhing
process
with this much number of document. I am giving order on some
fields of
documents. Could anybody tell be the expected result from lucene
engine with
3 million results and order/sorted on 4 column.
Thanks,
Manoj
-------------------------------------------------------------------
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]