Ewan Mellor created TIKA-2584: --------------------------------- Summary: Tika should have a way to pass arbitrary Tesseract options Key: TIKA-2584 URL: https://issues.apache.org/jira/browse/TIKA-2584 Project: Tika Issue Type: Improvement Components: parser Affects Versions: 1.17 Reporter: Ewan Mellor
Tesseract has a very large number of config options (use tesseract --print-parameters to see them). There is no mechanism for TesseractOCRParser / TesseractOCRConfig to pass these to Tesseract, and so they cannot be controlled by user code. Tika should pass these through as opaque key-value pairs, so that user code can set them as necessary. -- This message was sent by Atlassian JIRA (v7.6.3#76005)