Re: How to extract highest TF-IDF terms from Lucene index?

2012-05-09 Thread Mike McCandless
There is a tool named HighFregTerms, in contrib/misc that does this... Mike Sent from my iPad On May 9, 2012, at 4:18 PM, Michael Berkovsky wrote: > Hi, > > Assuming that there is a large lucene collection, and I want to extract top > N terms with highest TF/IDF scores from some field. > The

Re: Concurrency and multiple merge threads

2012-02-19 Thread Mike McCandless
Sounds like a nice machine! It's frustrating that RAMFile even has any sync'd methods... Lucene is write once, so once a RAMFile is written we don't need any sync to read it. Maybe on creating a RAMInputStream we could make a new ReadOnlyRAMFile, holding the same buffers without sync. That sa