Gwen Shapira created KAFKA-3250:
-----------------------------------

             Summary: release tarball is unnecessarily large due to duplicate 
libraries
                 Key: KAFKA-3250
                 URL: https://issues.apache.org/jira/browse/KAFKA-3250
             Project: Kafka
          Issue Type: Bug
            Reporter: Gwen Shapira


Between 0.8.2.2 and 0.9.0, our release tarballs grew from 17M to 34M. We 
thought it is just due to new libraries and dependencies. But:
1. If you untar Kafka into a directory and check the directory size (du -sh), 
it is around 28M, smaller than the tarball. Recompressing give you 25M tarball.
2. If you list the original tar contents and grep for "snappy", you see it 4 
times in the tarball.

Clearly we are creating a tarball with duplicates (and we didn't before).
I think its due to how we are generating the tarball from core but pull in 
other projects into libs/ directory with their dependencies (which overlap).

We need to find out how to sort it out (possibly with excludes).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to