Adrian Bird created TIKA-4736:
---------------------------------

             Summary: tika-app-4.0.0-alpha1 - image extraction fails when using 
--extract
                 Key: TIKA-4736
                 URL: https://issues.apache.org/jira/browse/TIKA-4736
             Project: Tika
          Issue Type: Bug
            Reporter: Adrian Bird


Using -z works:
{code:java}
%JAVA_HOME%\bin\java -jar %TIKA_JAR% -z --extract-dir=ImageFiles MyTestFile.pdf 
{code}
Using --extract fails:
{code:java}
%JAVA_HOME%\bin\java -jar %TIKA_JAR% --extract --extract-dir=ImageFiles 
MyTestFile.pdf 
Exception in thread "main" org.apache.tika.exception.TikaConfigException: Can't 
have more than 2 unknown args: [--extract, -o, ImageFiles, MyTestFile.pdf]{code}
If I change the order I get a different error:
{code:java}
%JAVA_HOME%\bin\java -jar %TIKA_JAR% --extract-dir=ImageFiles --extract 
MyTestFile.pdf 
Exception in thread "main" org.apache.tika.exception.TikaConfigException: Can 
only set inputDir and outputDir once. Extra args: [--extract, MyTestFile.pdf]

{code}
 

I also noticed that V4 produces an error if you do the extraction more than 
once, saying the image files already exist, where V3 created multiple files.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to