Re: Memory Usage

Daniel Noll Tue, 15 Nov 2005 21:30:39 -0800

Marvin Humphrey wrote:

The formatting of the results turned up a little screwy in my emailreader, so here's a reformatted version...

I noticed the same thing on Thunderbird, although viewing the sourceshowed that the original was okay, and KMail didn't seem to have thesame issue. However, the quoting at the front of the table does appearto fix the formatting. :-)

I'm only passingly familiar with the org.apache.lucene.searchpackage, so I'm not sure what could account for this; I wouldnormally expect a more common term to take longer, as there are moredocs to score. Anybody got a expanation handy?

I was figuring that a term which exists in more documents would bequicker to populate the initial hits for, but a term which has less thanthe number of initial hits would take longer. But you're right, thatdoesn't sound like the behaviour of an index at all, it should be linearuntil scoring enters into it.

I'm not sure how scoring affects all of this at the moment, though... weactually performed the norms-removal hack on our copy of Lucene as well(that reduced memory usage even more) before doing all of this testingand I'm not sure whether that would affect the scoring also (it doesn'thave to read the norms in during the search, which must have madesearching faster overall as a side-effect.)


Daniel


--
Daniel Noll

NUIX Pty Ltd
Level 8, 143 York Street, Sydney 2000
Phone: (02) 9283 9010
Fax:   (02) 9283 9020

This message is intended only for the named recipient. If you are not
the intended recipient you are notified that disclosing, copying,
distributing or taking any action in reliance on the contents of this
message or attachment is strictly prohibited.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Memory Usage

Reply via email to