posting list strings

2013-07-05 Thread Sriram Sankar
It looks like Lucene stores the string names of the posting lists in the index. How compact is this storage (when there may be a very large number of posting lists, and the string lengths may be large - for example, I may have an entry that looks like "Cn:4593846->8957363,485". I've seen other sy

Re: Accumulating facets over a MultiReader

2013-07-05 Thread Shai Erera
no option 1 is better than reindexing, but option 2 is the fastest imo. Shai On Fri, Jul 5, 2013 at 6:55 PM, Peng Gao wrote: > Thanks. > > Yes, that's the case. I'll try it out. > > Is Option 1 more expensive than re-indexing? > > > > -Original Message- > > From: Shai Erera [mailto:ser

RE: Accumulating facets over a MultiReader

2013-07-05 Thread Peng Gao
Thanks. Yes, that's the case. I'll try it out. Is Option 1 more expensive than re-indexing? > -Original Message- > From: Shai Erera [mailto:ser...@gmail.com] > Sent: Friday, July 05, 2013 8:25 AM > To: java-user@lucene.apache.org > Subject: Re: Accumulating facets over a MultiReader >

Re: Accumulating facets over a MultiReader

2013-07-05 Thread Shai Erera
Yes, there are two ways to do that. First, I assume that what you want to do is a.addIndexes(b), and if a document D in b is in a, you don't want to add it to a, right? In that case, two options: Option 1 Iterate on the documents in b (their primary key) and if a doc is found in a, delete it from

RE: Accumulating facets over a MultiReader

2013-07-05 Thread Peng Gao
Shai, Once again, thanks for the help. Yes, I am re-indexing. Using FacetFields.addFacets() on the doc works. Given that I need to check the uniqueness before merging an index with facets into a master, is there better way to it without re-indexing? Gao Peng > -Original Message- > From:

Re: Regarding Lucene Highlighting feature.

2013-07-05 Thread Roberto Ragusa
On 07/05/2013 01:27 PM, VIGNESH S wrote: > Hi, > > I think using CompressingStoredFieldsFormat Feature introduced in Lucene > 4.1 may help reduce index size. > > Any other comments and suggestions are welcome in this topic.. > Do you have access to the original documents, outside Lucene? If so,

Re: Regarding Lucene Highlighting feature.

2013-07-05 Thread VIGNESH S
Hi, I think using CompressingStoredFieldsFormat Feature introduced in Lucene 4.1 may help reduce index size. Any other comments and suggestions are welcome in this topic.. Thanks and Regards Vignesh Srinivasan 9739135640 On Thu, Jul 4, 2013 at 6:38 PM, VIGNESH S wrote: > Hi, > > Is it mandat