Re: Optimize and Out Of Memory Errors

Mark Miller Tue, 23 Dec 2008 09:26:00 -0800

Mark Miller wrote:

Lebiram wrote:
Also, what are norms
Norms are a byte value per field stored in the index that is factoredinto the score. Its used for length normalization (shorter documents =more important) and index time boosting. If you want either of those,you need norms. When norms are loaded up into an IndexReader, itsloaded into a byte[maxdoc] array for each field - so even if onedocument out of 400 million has a field, its still going to loadbyte[maxdoc] for that field (so a lot of wasted RAM). Did you say youhad 400 million docs and 7 fields? Google says that would be:
   **400 million x 7 byte = 2 670.28809 megabytes**

On top of your other RAM usage.

Just to avoid confusion, that should really read a byte per document perfield. If I remember right, it gives 255 boost possibilities, limited to25 with length normalization.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Optimize and Out Of Memory Errors

Reply via email to