Thank you ipowers for your reply. Perhaps I did not make myself clear enough. As I explained in my original posting I want to store large documents in the Lucene index. Storing them elsewhere is not an option, otherwise I would have done so already. My real question is question number one, which did not receive a reply, is there a formula that can tell me if what is happening is reasonable and to be expected, or am I doing something wrong or something which can be done better. I realise I can breakdown a large file into smaller chunks, but I don't want to do that unless the answer to question one is that there is no other alternative.
Cheers




From: "George Washington" <[EMAIL PROTECTED]>
Reply-To: java-user@lucene.apache.org
To: java-user@lucene.apache.org
Subject: Storing large text or binary source documents in the index and memory usage
Date: Fri, 20 Jan 2006 04:52:20 +0000

I would like to store large source documents (>10MB) in the index in their original form, i.e. as text for text documents or as byte[] for binary documents. I have no difficulty adding the source document as a field to the Lucene index document, but when I write the index document to the index I consistently get out-of-memory errors for documents larger than approx 9MB. Is there a formula that can help calculate the max size of a document which can be added to the index? Is there an alternative way to store such large documents that you can suggest?
I have 512MB memory under WinXP. Increasing the VM heap size does not help.
Many thanks

_________________________________________________________________
Make your dream car a reality http://a.ninemsn.com.au/b.aspx?URL=http%3A%2F%2Fcarpoint%2Eninemsn%2Ecom%2Eau&_t=12345&_r=emailtagline&_m=EXT


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


_________________________________________________________________
Make your dream car a reality http://a.ninemsn.com.au/b.aspx?URL=http%3A%2F%2Fcarpoint%2Eninemsn%2Ecom%2Eau&_t=12345&_r=emailtagline&_m=EXT


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to