Ian Lea wrote: Thank you for the quick and helpful reply. I had forgotten that Lucene's change document was one of best example of change documents around. I will read it.
On the specific question, calling doc() is still expensive. You could look at the FieldCache or the new DocValues stuff. See http://www.searchworkings.org/blog/-/blogs/introducing-lucene-index-doc-values for info on the latter.
I will explore that. I occurred to me that I do not know why the search performance degrades when doc() is called within the Collector. Is it simply that Lucene will present, for example, thousands of candidate hits (from millions of indexed documents) to the Collector even though the collector might only return the top handful? And so the Collector will need to load thousands of documents and it is this document loading that causes the performance degradation? Or is it more complex -- perhaps having to do with caches and other internals? -- Andrew --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org