Re: Getting Top n term for a given field for a given time period

2009-04-24 Thread Michael McCandless
Make a RangeFilter that visits only docs in your time period, then run a search w/ a custom HitCollector that looks at the source of each doc and tallies up the results? For performance, you'll probably need to load the source using FieldCache (FieldCache.DEFAULT.getStrings(...)). Or, use Solr's

Getting Top n term for a given field for a given time period

2009-04-21 Thread Preetham Kajekar
Hi, I have a lucene index which has 20 mil documents. Each document has a timestamp field and a source field. I am interested in finding the top n sources for a given hour (based on the timestamp). I know we can get the top n sources fields easily using the IndexReader API, but was wondering