Chris Lu writes: > 1. Need an efficient way to pick up the most frequent words in an index. > I think this can be done, any example will be appreciated. I don't see an alternative to looping through all terms and look at their frequency.
> 2. search by the most freqent words, with sort by options > > Is this the only way to warm up lucene? For large indexes, the first > sort-by search is slow. > that's independent of what you search for. sort by is done by creating a in memory array of all field values of all documents. This array is cached and reused for further searches. So if you use sort, doing one sort after creating the index might be useful. For reading relevant parts of the index into OS caches, I'd rather use the most commonly searched terms, than the most frequent ones. HTH Morus --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]