Is this related to 
https://stackoverflow.com/questions/72310943/poi-single-vs-multithreaded-performance
 ?






On Friday 20 May 2022, 15:41:28 IST, Wabi Sabi <wabisabi2...@gmail.com> wrote: 





Hello,

I am trying to parallelize Excel processing and I am noticing a
bizarre behavior - single threaded processing is actually faster...

I am not doing anything fancy. I just open an XSSFWorkbook, fill out some
values, run formula calcs and read the output. If I run single threaded -
initial run takes a few seconds to complete (assume because JVM needs to
load POI + all the XML, schemas, etc.), but performance improves
and subsequent runs all take about 100-200 ms.

Same logic executed in a separate thread runs easily for 5 seconds in each
thread.... So turns out that single threaded processing of say 10 files is
at 4.5 seconds, but multithreaded takes 5-6 easily... No files are shared
among threads.

The hotspots are in POIXMLDocument.load. Thread behavior also looks
correct. File contention is out of the picture too - reading a different
file each time.

Any ideas as two why or pointers at the POI multithreading best practices
are greatly appreciated.  Thank you very much in advance!

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@poi.apache.org
For additional commands, e-mail: user-h...@poi.apache.org

Reply via email to