Ewan Mellor created TIKA-2584:
---------------------------------

             Summary: Tika should have a way to pass arbitrary Tesseract options
                 Key: TIKA-2584
                 URL: https://issues.apache.org/jira/browse/TIKA-2584
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.17
            Reporter: Ewan Mellor


Tesseract has a very large number of config options (use tesseract 
--print-parameters to see them).  There is no mechanism for TesseractOCRParser 
/ TesseractOCRConfig to pass these to Tesseract, and so they cannot be 
controlled by user code.

Tika should pass these through as opaque key-value pairs, so that user code can 
set them as necessary.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to