Re: About the search efficiency based on document's length

Karl Wettin Thu, 20 Sep 2007 23:52:47 -0700

21 sep 2007 kl. 08.23 skrev Jarvis:

There is a question about the document’s length and search efficiency.

Two ways to index some html pages(ignore some information): one isbothstore and index the html content in lucene dictionary, the other isjust
index the content . For the first method is there a efficiency problem
compare to the second besides the folder size increase?


Not sure I understand your question, but I'll give it a go.

As far as I know, storing data in a document will not affect searchspeed. However, loading large amounts of data to a Document will ofcourse consume resources. Therefor it is possible to pass aFieldSelector to the IndexReader when you retrieve a Document,allowing you to define what fields to ignore, load, lazy load, et c.


I hope this helps.

--
karl
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: About the search efficiency based on document's length

Reply via email to