On 3/21/07, Peter Keegan <[EMAIL PROTECTED]> wrote:
On a similar topic, has anybody measured query performance as a function of index size? Well, I did and the results surprised me. I measured query throughput on 8 indexes that varied in size from 55,000 to 4.4 million documents. When plotted on a graph, there is a distinct hyperbolic curve (1/x). I expected to see more of a linear curve with a sharp drop-off at some point. Interesting
The size of the index should also grow sub-linearly. I think some of what you are seeing is due to index compression. As you add more and more documents, there are fewer new terms added to the index. If you searched for a term in most of the documents, you should see linear scaling with number of documents. But those are cases when you want to separate out that component and cache it :-) -Yonik --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]