Re: List of files that Lucene 4.0 generates during indexing

2013-01-30 Thread saisantoshi
The following files are originally created files (upon an initial indexing): _0.fdt _0.fdx _0.fnm _0.si _0_Lucene40_0.frq _0_Lucene40_0.prx _0_Lucene40_0.tim _0_Lucene40_0.tip _0_nrm.cfe _0_nrm.cfs index.v0008

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread saisantoshi
Thanks. Could you please also comment on the following as well? http://lucene.472066.n3.nabble.com/TopDocCollector-vs-TopScoreDocCollector-semantics-changed-in-4-0-not-backward-comptabile-td4035806.html Thanks and really appreciate your help. Thanks, Sai. -- View this message in context: htt

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread Michael McCandless
You get/set the merge policy on IndexWriterConfig (which you pass to IndexWriter). And then you can set this CFS ratio via that merge policy. Mike McCandless http://blog.mikemccandless.com On Thu, Jan 24, 2013 at 5:35 PM, saisantoshi wrote: > Thanks a lot. One last question, how do we set it?

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread saisantoshi
Thanks a lot. One last question, how do we set it? IndexWriter.??? Thanks, Ranjith. -- View this message in context: http://lucene.472066.n3.nabble.com/List-of-files-that-Lucene-4-0-generates-during-indexing-tp4035993p4036091.html Sent from the Lucene - Java Users mailing list archive at Nabbl

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread Michael McCandless
I would leave the default until/unless something goes wrong ... Mike McCandless http://blog.mikemccandless.com On Thu, Jan 24, 2013 at 5:28 PM, saisantoshi wrote: > Thanks. Are there any best practices to follow here? or leave the the > default > ( which is hybrid approach as you mentioned). >

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread saisantoshi
Thanks. Are there any best practices to follow here? or leave the the default ( which is hybrid approach as you mentioned). -- View this message in context: http://lucene.472066.n3.nabble.com/List-of-files-that-Lucene-4-0-generates-during-indexing-tp4035993p4036086.html Sent from the Lucene - J

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread Michael McCandless
4.0 has a hybrid approach by default: "big" segments (> 10% of index size, by default) are non-compound-files and small segments are compound files. See TieredMergePolicy.setNoCFSRatio if you want to always use compound file format. Mike McCandless http://blog.mikemccandless.com On Thu, Jan 24

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread saisantoshi
Thanks Michael. The additional file in the list is just a typo. One more question is, we were using 2.4 before, and it only generated few files _0.cfs _0.cfx // segment files I am assuming that the 2.4 version has the compound index structure enabled by default. Do we need to set it explicitly w

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread Michael McCandless
That looks correct, except I don't know what index.v0008 is. Mike McCandless http://blog.mikemccandless.com On Thu, Jan 24, 2013 at 1:22 PM, saisantoshi wrote: > Thanks. I checked it out. > > Here are the list of files that has been generated: > > _0.fdt > _0.fdx > _0.f

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread saisantoshi
Thanks. I checked it out. Here are the list of files that has been generated: _0.fdt _0.fdx _0.fnm _0.si _0_Lucene40_0.frq _0_Lucene40_0.prx _0_Lucene40_0.tim _0_Lucene40_0.tip _0_nrm.cfe _0_nrm.cfs index.v000

Re: List of files that Lucene 4.0 generates during indexing

2013-01-24 Thread Steve Rowe
Hi saisantoshi, Check out the documentation: - particularly the "File Formats" link under "Reference Documents". Steve On Jan 24, 2013, at 11:41 AM, saisantoshi wrote: > Is there any doc on how many files that lucene generates during indexing >