Pardon the length of the question. I have an index with 100 million docs
(lucene not solr) and term queries (A*, A AND B* type queries) return
pretty quickly (2 -4 secs) and I pick the lucene docIds up pretty quickly
with a collector. This is good for us since we take the docIds and do
further filt
n minimize the network
> traffic by setting field list to only this record_id.
>
> -sujit
>
>
> On Thu, Sep 18, 2014 at 9:23 PM, Shouvik Bardhan
> wrote:
>
> > Pardon the length of the question. I have an index with 100 million docs
> > (lucene not solr) and term que
I will take a look at DocField. Thanks for the suggestion.
On Fri, Sep 19, 2014 at 6:30 PM, Neil Bacon wrote:
> Hi
> Have you looked at DocFieldValue / DocField? It's fast for this use case.
> Regards
> Neil
>
> Sent from my mobile doovalaki
>
> On 20/09/2014 6:
Apologies if this is trivial but I have read what I could on the net and
could not come up with a good answer.
I have stood up a lucene server (not solr) which listens on 2 ports - one
for indexing and one for search. Also on different threads. The indexer
threads use one IndexWriter and the searc
<
luc...@mikemccandless.com> wrote:
> Call commit.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Tue, Nov 4, 2014 at 8:28 AM, Shouvik Bardhan
> wrote:
> > Apologies if this is trivial but I have read what I could on the net and
> > co
understand.
thanks for the help,
On Tue, Nov 4, 2014 at 12:01 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> On Tue, Nov 4, 2014 at 11:44 AM, Shouvik Bardhan
> wrote:
>
> > Thanks for the reply (and thanks for everything else too !!) Mike.
>
> You'
Apologies if I have missed it in discussions prior but I looked all over. I
looked at the Luke code and it does find high frequency terms on the entire
index. I am trying to get the top N high frequency terms in the documents
returned from a search result. I came across something called
FilterIndex
e for it's help
> Tomoko
>
> 2015-02-16 1:58 GMT+09:00 Shouvik Bardhan :
>
> > Apologies if I have missed it in discussions prior but I looked all
> over. I
> > looked at the Luke code and it does find high frequency terms on the
> entire
> > index. I am try
Apologies if this has been discussed and thrashed out before. I found some
discussion but still not clear about several things. Based on one of Mike's
answers a while back, I have ran my test program with a lucene-core jar
which was built with VERBOSE_REF_COUNTS = true. This is all on Lucene 6.6.2