[
https://issues.apache.org/jira/browse/SOLR-17958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18029945#comment-18029945
]
David Smiley commented on SOLR-17958:
-------------------------------------
+1
> Deprecate TikaLanguageIdentifierUpdateProcessor in v9.10
> --------------------------------------------------------
>
> Key: SOLR-17958
> URL: https://issues.apache.org/jira/browse/SOLR-17958
> Project: Solr
> Issue Type: Improvement
> Components: contrib - LangId
> Reporter: Jan Høydahl
> Assignee: Jan Høydahl
> Priority: Major
>
> In the 'langid' module, we have three implementations of language detectors.
> The oldest one is TikaLanguageIdentifierUpdateProcessor, but we also have
> LangDetectLanguageIdentifierUpdateProcessor and
> OpenNLPLangDetectUpdateProcessor.
> This JIRA will deprecate TikaLanguageIdentifierUpdateProcessor.
> Reasons are:
> - The others are proably better
> - We want to remove Tika as a direct Solr dependency
> - The tika identifier is based on a Tika 1.x API that has been removed (they
> are now "Detectors" instead)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]