: One simple way of doing this is maybe to write a wrapper for TermQuery
: that only returns docs with a Term Frequency > X as far as I
: understand the question those terms don't have to be within a certain
: window right?
I don't think you could do it as a Query Wrapper -- it would have to be a
Scorer wrapper, correct?
That's the approach rmuir and i were discussing on friday, and i just
posted a patch of the "guts" that could use some review...
https://issues.apache.org/jira/browse/LUCENE-3395
..the end goal would be options in TermQuery that would cause it to
automaticly wrap it's Scorer in one of these, ala..
TermQuery q = new TermQuery(new Term("foo","bar"));
q.setMinFreq(4.0f);
q.setMaxFreq(1000.0f);
...and in solr, options for this could be added to the {!term} parser...
q={!term f=foo minTf=4.0 maxTf=1000.0}bar
(could maybe add syntax to the regular query parser, but i think our
strategic meta-character reserves are dangerously low)
-Hoss