[
https://issues.apache.org/jira/browse/TIKA-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14329523#comment-14329523
]
Uwe Schindler edited comment on TIKA-1557 at 2/20/15 9:05 PM:
--------------------------------------------------------------
I would not make this a special option only for tesseract. As said on
TIKA-1555, it would be better to have a general way to "blacklist" some parsers
through TikaConfig.
Currently you have to maintain the whole list of parsers (or parse META-INF
yourself) and pass the full list to TikaConfig / AutodetectParser /
CompositeParser. I would like to have an option in TIKA config to "blacklist"
parsers. Ideally this should also work for subclasses, so one could disable all
ExternalParser subclasses by adding ExternalParser to blacklist.
was (Author: thetaphi):
I would not make this a special option only for tesseract. As said on
TIKA-1555, it would be better to have a general way to "blacklist" some parsers
through TikaConfig.
Currently you have to maintain the whole list of parsers (or parse META-INF
yourself) and pass the full list to TikaConfig / AutodetectParser /
CompositeParser. I would like to have an option in TIKA config to "blacklist"
parsers. Ideally this should also work for subclasses, so one could disable all
ForkParser subclasses by adding ForkParser to blacklist.
> Create TesseractOCR Option to Never Run
> ---------------------------------------
>
> Key: TIKA-1557
> URL: https://issues.apache.org/jira/browse/TIKA-1557
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Reporter: Tyler Palsulich
> Assignee: Tyler Palsulich
> Priority: Minor
> Fix For: 1.8
>
>
> As brought up in TIKA-1555, TesseractOCRParser should have an option to never
> be run. So, we can add an {{enabled}} option to the Config.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)