Re: Size of lucene norm file

2011-09-18 Thread Erick Erickson
Here's a useful link as well: http://lucene.apache.org/java/3_0_2/fileformats.html#file-names Erick On Sun, Sep 18, 2011 at 1:17 AM, roz dev wrote: > Norms (*.nrm) > > Norms are an index time normalization factor that can be factored into > scoring. Document and field boosts as well as length n

Re: Size of lucene norm file

2011-09-18 Thread Li Li
docNum * IndexedFieldsNum * 1 Bytes you should disable indexed fields which are not used for relevancy rank. On Sun, Sep 18, 2011 at 5:20 AM, roz dev wrote: > Hi, > > I want to estimate the size of NORM file that lucene will generate for a 20 > Gb index which has 2.5 Million Docs and 50 fields

RE: Size of lucene norm file

2011-09-18 Thread Uwe Schindler
http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: roz dev [mailto:rozde...@gmail.com] > Sent: Sunday, September 18, 2011 10:18 AM > To: java-user@lucene.apache.org > Subject: Re: Size of lucene norm file > > Norms (*.nrm) > > Norms

Re: Size of lucene norm file

2011-09-18 Thread roz dev
Norms (*.nrm) Norms are an index time normalization factor that can be factored into scoring. Document and field boosts as well as length normalization are applied with norms. When in memory, norms occupy one byte per document for each field with norms on, even if only one document has norms on fo

Re: Size of lucene norm file

2011-09-18 Thread janwen
What is NORM file? On 2011-9-18 5:20, roz dev wrote: Hi, I want to estimate the size of NORM file that lucene will generate for a 20 Gb index which has 2.5 Million Docs and 50 fields in each document. Is there any formula to predict it? And, what is the RAM cost of this nrm file. Thanks S