https://bz.apache.org/bugzilla/show_bug.cgi?id=58963

--- Comment #4 from Javen O'Neal <one...@apache.org> ---
Looking through eu-triggers-oom.xlsx, I didn't see any XML bombs likely to
bloat memory. The main memory consumer here is shared strings table containing
5700 unique values. The rest of the XML files are mostly empty.

To build up a test case to make sure there isn't anything else in the Excel
file, a unit test could be: read in a dictionary of random Norwegian words
(including æ, ø, å--if that matters), generate 5700 "sentences" composed of 10
random words each, and look at memory consumption.

The SharedStringsTable [1] uses an Array<CTRst> and a Map<String, Int> to store
the strings. The Map makes string lookup faster at the cost of increasing the
memory requirements, but both structures should be able to handle 6700 entries
without OOM'ing.

What kinds of objects did VisualVM indicate were consuming the largest amount
of memory?

[1]
https://svn.apache.org/viewvc/poi/trunk/src/ooxml/java/org/apache/poi/xssf/model/SharedStringsTable.java?view=markup

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to