Thank you ipowers for your reply. Perhaps I did not make myself clear
enough. As I explained in my original posting I want to store large
documents in the Lucene index. Storing them elsewhere is not an option,
otherwise I would have done so already. My real question is question number
one, which did not receive a reply, is there a formula that can tell me if
what is happening is reasonable and to be expected, or am I doing something
wrong or something which can be done better. I realise I can breakdown a
large file into smaller chunks, but I don't want to do that unless the
answer to question one is that there is no other alternative.
Cheers
From: "George Washington" <[EMAIL PROTECTED]>
Reply-To: java-user@lucene.apache.org
To: java-user@lucene.apache.org
Subject: Storing large text or binary source documents in the index and
memory usage
Date: Fri, 20 Jan 2006 04:52:20 +0000
I would like to store large source documents (>10MB) in the index in their
original form, i.e. as text for text documents or as byte[] for binary
documents.
I have no difficulty adding the source document as a field to the Lucene
index document, but when I write the index document to the index I
consistently get out-of-memory errors for documents larger than approx 9MB.
Is there a formula that can help calculate the max size of a document which
can be added to the index?
Is there an alternative way to store such large documents that you can
suggest?
I have 512MB memory under WinXP. Increasing the VM heap size does not help.
Many thanks
_________________________________________________________________
Make your dream car a reality
http://a.ninemsn.com.au/b.aspx?URL=http%3A%2F%2Fcarpoint%2Eninemsn%2Ecom%2Eau&_t=12345&_r=emailtagline&_m=EXT
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
_________________________________________________________________
Make your dream car a reality
http://a.ninemsn.com.au/b.aspx?URL=http%3A%2F%2Fcarpoint%2Eninemsn%2Ecom%2Eau&_t=12345&_r=emailtagline&_m=EXT
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]