[ 
https://issues.apache.org/jira/browse/TIKA-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16419684#comment-16419684
 ] 

Hudson commented on TIKA-2584:
------------------------------

FAILURE: Integrated in Jenkins build tika-branch-1x #14 (See 
[https://builds.apache.org/job/tika-branch-1x/14/])
Fix for TIKA-2584 contributed by ewanmellor. (tallison: 
[https://github.com/apache/tika/commit/b2ca3781f7a27e7c0ca627359d13a66c56940039])
* (edit) 
tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRConfig.java
* (edit) 
tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java


> Tika should have a way to pass arbitrary Tesseract options
> ----------------------------------------------------------
>
>                 Key: TIKA-2584
>                 URL: https://issues.apache.org/jira/browse/TIKA-2584
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.17
>            Reporter: Ewan Mellor
>            Priority: Minor
>             Fix For: 1.18, 2.0.0
>
>
> Tesseract has a very large number of config options (use tesseract 
> --print-parameters to see them).  There is no mechanism for 
> TesseractOCRParser / TesseractOCRConfig to pass these to Tesseract, and so 
> they cannot be controlled by user code.
> Tika should pass these through as opaque key-value pairs, so that user code 
> can set them as necessary.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to