Re: Help with huge index

2018-02-28 Thread Stuart Goldberg
Thanks so much. I actually found that my purging routine finished after about 35 minutes which is really acceptable given that this routine is supposed to run during the overnight period. On Feb 28, 2018 8:34 PM, "Adrien Grand" wrote: > Thanks. Deleting lots of documents can indeed trigger a lot

Re: Help with huge index

2018-02-28 Thread Adrien Grand
Thanks. Deleting lots of documents can indeed trigger a lot of work in the Lucene side. First Lucene likely needs to rewrite the live docs of all your segments and then this might trigger significant merging activity due to the fact that Lucene tries to keep the number of deleted docs reasonable so

Re: Help with huge index

2018-02-28 Thread Stuart Goldberg
I call deleteDocuments On Feb 28, 2018 8:16 PM, "Adrien Grand" wrote: > What do you mean by purging? What methods do you call? > > Le mer. 28 févr. 2018 à 19:34, Stuart Goldberg a > écrit : > > > I have huge lucene index. On disk it's about 24Gb. > > > > > > > > I have a purging routine that is

Re: Help with huge index

2018-02-28 Thread Adrien Grand
What do you mean by purging? What methods do you call? Le mer. 28 févr. 2018 à 19:34, Stuart Goldberg a écrit : > I have huge lucene index. On disk it's about 24Gb. > > > > I have a purging routine that is supposed to run and purge old docs. > > > > There are about 650 million docs in there and

Re: Count terms for IntPoint field

2018-02-28 Thread Adrien Grand
You probably want to look at PointValues.size(), which gives you the number of indexed points. Doc values do not support index statistics however. Le mer. 28 févr. 2018 à 21:47, Riccardo Tasso a écrit : > Hello, > I'm porting an application from lucene 4 to lucene 7. > > I've converted a field

Re: Recommendation for doing a search plus collecting extra information?

2018-02-28 Thread Trejkaz
On Mon, Oct 12, 2015 at 3:28 PM, Uwe Schindler wrote: > Hi, > > it may sound a bit stupid, but you can do the following: > > If you search for a docvalues (previously fieldcache) field in lucene, the > returned TopFieldDocs contains also the field values > that were sorted against. The ScoreDoc i

Count terms for IntPoint field

2018-02-28 Thread Riccardo Tasso
Hello, I'm porting an application from lucene 4 to lucene 7. I've converted a field from IntField to IntPoint and at query or indexing time everything is ok. When I call the method: reader.getSumTotalTermFreq(field); it returns zero for my IntPoint field. I understand that IntPoint is stored i

Help with huge index

2018-02-28 Thread Stuart Goldberg
I have huge lucene index. On disk it's about 24Gb. I have a purging routine that is supposed to run and purge old docs. There are about 650 million docs in there and through testing I have determined that about 1/3 of these need to be purged. During the purge, every so often it's appare

Re: Help required in Ways to compress index size.

2018-02-28 Thread benafia salem
Hi prtahp, I think the first step you need to check if you want to reduce your index size, is avoid "storing" fields. Do a first test and check if your search performance still met your expections. 2018-02-28 12:46 GMT+01:00 prathap simha : > Hi Lucene Team, > > Greetings of the day. Thanks for

Help required in Ways to compress index size.

2018-02-28 Thread prathap simha
Hi Lucene Team, Greetings of the day. Thanks for your great search engine. We are using the .Net version of Lucene for our product search application component. Because of customers huge project sizes our search index size is very huge day by day. So, we want to reduce the index size by using t