[ https://issues.apache.org/jira/browse/TIKA-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison updated TIKA-2492: ------------------------------ Fix Version/s: (was: 2.0.0) 2.0.0-BETA > Remove pdfdebugger from tika > ---------------------------- > > Key: TIKA-2492 > URL: https://issues.apache.org/jira/browse/TIKA-2492 > Project: Tika > Issue Type: Improvement > Components: packaging > Reporter: Tilman Hausherr > Priority: Minor > Fix For: 1.17, 2.0.0-BETA > > > PDFDebugger isn't needed in tika but it is a dependency in pdfbox-tools > (because that one contains the command line interface, which calls the PDFBox > command line tools). > Thus I suggest that the tika parser pom be changed like this: > {code} > <dependency> > <groupId>org.apache.pdfbox</groupId> > <artifactId>pdfbox-tools</artifactId> > <version>${pdfbox.version}</version> > <exclusions> > <exclusion> > <groupId>commons-logging</groupId> > <artifactId>commons-logging</artifactId> > </exclusion> > + <exclusion> > + <groupId>org.apache.pdfbox</groupId> > + <artifactId>pdfbox-debugger</artifactId> > + </exclusion> > </exclusions> > {code} > This saves you 200KB in tika-app. That's not much, but every weight loss > counts :-) > It should also be possible to get it removed from tika-bundle, but I don't > know how to remove it properly. Just removing it from "Embed-Dependency" > isn't enough. -- This message was sent by Atlassian Jira (v8.3.4#803005)