Re: RFR 8080640: Reduce copying when reading JAR/ZIP entries

Staffan Friberg Thu, 21 May 2015 09:50:24 -0700


On 05/20/2015 10:57 AM, Xueming Shen wrote:

On 05/18/2015 06:44 PM, Staffan Friberg wrote:
Hi,
Wanted to get reviews and feedback on this performance improvementfor reading from JAR/ZIP files during classloading by reducingunnecessary copying and reading the entry in one go instead of insmall portions. This shows a significant improvement when reading asingle entry and for a large application with 10k classes and 500+JAR files it improved the startup time by 4%.
For more details on the background and performance results please seethe RFE entry.
RFE - https://bugs.openjdk.java.net/browse/JDK-8080640
WEBREV - http://cr.openjdk.java.net/~sfriberg/JDK-8080640/webrev.0

Cheers,
Staffan
Hi Staffan,
If I did not miss something here, from your use scenario it appears tome the only thing you really
need here to help boost your performance is

    byte[] ZipFile.getAllBytes(ZipEntry ze);
You are allocating a byte[] at use side and wrapping it with aByteBuffer if the size is small enough,otherwise, you letting the ZipFile to allocate a big enough one foryou. It does not look like youcan re-use that byte[] (has to be wrapped by the ByteArrayInputStreamand return), why do youneed two different methods here? The logic would be much easier tosimply let the ZipFile to allocatethe needed buffer with appropriate size, fill the bytes and return,with a "OOME" if the entry size
is bigger than 2g.
The only thing we use from the input ze is its name, get thesize/csize from the jzentry, I don't think
jzentry.csize/size can be "unknown", they are from the "cen" table.
If the real/final use of the bytes is to wrap it with aByteArrayInputStream,why bother using ByteBufferhere? Shouldn't a direct byte[] with exactly the size of the entryserver better.
-Sherman

Hi Sherman,

Thanks for the comments. I agree, was starting out with bytebufferbecause I was hoping to be able to cache things where the buffer wasbeing used, but since the buffer is past along further I couldn't figureout a clean way to do it.Will rewrite it to simply just return a buffer, and only wrap it in theResource class getByteBuffer.

What would be your thought on updating the ZipFile.getInputStream toreturn ByteArrayInputStream for small entries? Currently I do that workoutside in two places and moving it would potentially speed up othersreading small entries as well.


Thanks,
Staffan

Re: RFR 8080640: Reduce copying when reading JAR/ZIP entries

Reply via email to