Quickest way to collect one field from the searched docs....

2014-09-18 Thread Shouvik Bardhan
Pardon the length of the question. I have an index with 100 million docs (lucene not solr) and term queries (A*, A AND B* type queries) return pretty quickly (2 -4 secs) and I pick the lucene docIds up pretty quickly with a collector. This is good for us since we take the docIds and do further filt

Re: Quickest way to collect one field from the searched docs....

2014-09-19 Thread Shouvik Bardhan
n minimize the network > traffic by setting field list to only this record_id. > > -sujit > > > On Thu, Sep 18, 2014 at 9:23 PM, Shouvik Bardhan > wrote: > > > Pardon the length of the question. I have an index with 100 million docs > > (lucene not solr) and term que

Re: Quickest way to collect one field from the searched docs....

2014-09-20 Thread Shouvik Bardhan
I will take a look at DocField. Thanks for the suggestion. On Fri, Sep 19, 2014 at 6:30 PM, Neil Bacon wrote: > Hi > Have you looked at DocFieldValue / DocField? It's fast for this use case. > Regards > Neil > > Sent from my mobile doovalaki > > On 20/09/2014 6:

When does a segment file gets written...

2014-11-04 Thread Shouvik Bardhan
Apologies if this is trivial but I have read what I could on the net and could not come up with a good answer. I have stood up a lucene server (not solr) which listens on 2 ports - one for indexing and one for search. Also on different threads. The indexer threads use one IndexWriter and the searc

Re: When does a segment file gets written...

2014-11-04 Thread Shouvik Bardhan
< luc...@mikemccandless.com> wrote: > Call commit. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Tue, Nov 4, 2014 at 8:28 AM, Shouvik Bardhan > wrote: > > Apologies if this is trivial but I have read what I could on the net and > > co

Re: When does a segment file gets written...

2014-11-05 Thread Shouvik Bardhan
understand. thanks for the help, On Tue, Nov 4, 2014 at 12:01 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > On Tue, Nov 4, 2014 at 11:44 AM, Shouvik Bardhan > wrote: > > > Thanks for the reply (and thanks for everything else too !!) Mike. > > You'

High frequency terms in results document....

2015-02-15 Thread Shouvik Bardhan
Apologies if I have missed it in discussions prior but I looked all over. I looked at the Luke code and it does find high frequency terms on the entire index. I am trying to get the top N high frequency terms in the documents returned from a search result. I came across something called FilterIndex

Re: High frequency terms in results document....

2015-02-19 Thread Shouvik Bardhan
e for it's help > Tomoko > > 2015-02-16 1:58 GMT+09:00 Shouvik Bardhan : > > > Apologies if I have missed it in discussions prior but I looked all > over. I > > looked at the Luke code and it does find high frequency terms on the > entire > > index. I am try

Index file deleting....

2017-11-21 Thread Shouvik Bardhan
Apologies if this has been discussed and thrashed out before. I found some discussion but still not clear about several things. Based on one of Mike's answers a while back, I have ran my test program with a lucene-core jar which was built with VERBOSE_REF_COUNTS = true. This is all on Lucene 6.6.2