21 sep 2007 kl. 09.09 skrev Jarvis:
Storing data in a document will not affect search speed.
This is helpful .
Someone should probably confirm that though.
And another question :)
When I make a search which will return 500000 results , it will be
very
inefficient when I want to get the document between the No.450000 to
No.450010 or some back document . Why was it ? Or some solution ?
I suppose you are referring to the class Hits? It should only be an
extra cost if you iterate a lot of documents priot to index 450000,
as that will force it to replace the query now and then.
It is a pretty simple peice of code. Go right ahead and take a look
at it:
<http://svn.apache.org/repos/asf/lucene/java/trunk/src/java/org/
apache/lucene/search/Hits.java>
--
karl
Thanks,
Jarvis .
-----Original Message-----
From: Karl Wettin [mailto:[EMAIL PROTECTED]
Sent: Friday, September 21, 2007 2:45 PM
To: java-user@lucene.apache.org
Subject: Re: About the search efficiency based on document's length
21 sep 2007 kl. 08.23 skrev Jarvis:
There is a question about the document’s length and search
efficiency.
Two ways to index some html pages(ignore some information): one is
both
store and index the html content in lucene dictionary, the other is
just
index the content . For the first method is there a efficiency
problem
compare to the second besides the folder size increase?
Not sure I understand your question, but I'll give it a go.
As far as I know, storing data in a document will not affect search
speed. However, loading large amounts of data to a Document will of
course consume resources. Therefor it is possible to pass a
FieldSelector to the IndexReader when you retrieve a Document,
allowing you to define what fields to ignore, load, lazy load, et c.
I hope this helps.
--
karl
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]