Re: occurrence of two terms with the highest frequency

2015-02-13 Thread Ian Lea
Sorry, finger trouble. Should have been oal which is shorthand for org.apache.lucene, so org.apache.lucene.search.TotalHitCountCollector. http://lucene.apache.org/core/4_10_3/core/org/apache/lucene/search/TotalHitCountCollector.html -- Ian. On Fri, Feb 13, 2015 at 6:55 PM, Maisnam Ns wrote:

Re: occurrence of two terms with the highest frequency

2015-02-13 Thread Maisnam Ns
Thanks Ian for your help. But I didn't get aol search, what it is ? tried searching in google but couldn't find. Thanks On Fri, Feb 13, 2015 at 3:00 AM, Ian Lea wrote: > I think you can do it with 4 simple queries: > > 1) +flying +shooting > > 2) +flying +fighting > > etc. > > or BooleanQuery e

Re: occurrence of two terms with the highest frequency

2015-02-12 Thread Ian Lea
I think you can do it with 4 simple queries: 1) +flying +shooting 2) +flying +fighting etc. or BooleanQuery equivalents with MUST clauses. Use aol.search.TotalHitCountCollector and it should be blazingly fast, even if you have more that 100 docs. -- Ian. On Thu, Feb 12, 2015 at 5:42 PM, Ma

occurrence of two terms with the highest frequency

2015-02-12 Thread Maisnam Ns
Hi, Can someone help me with this use case. Use case: Say there are 4 key words 'Flying', 'Shooting', 'fighting' and 'looking' in100 documents to search for. Consider 'Flying' and 'Shooting' co- occurs (together) in 70 documents where as 'Flying and 'fighting' co- occurs in 14 documents 'Flyin