[
https://issues.apache.org/jira/browse/SOLR-17958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18030491#comment-18030491
]
ASF subversion and git services commented on SOLR-17958:
--------------------------------------------------------
Commit 0a951f2c3f9815c33068a6fc2b44e3cc00739a21 in solr's branch
refs/heads/branch_9x from Jan Høydahl
[ https://gitbox.apache.org/repos/asf?p=solr.git;h=0a951f2c3f9 ]
SOLR-17958 Deprecate TikaLanguageIdentifierUpdateProcessor (#3776) (#3783)
(cherry picked from commit 6b33f387423ac00350828339c4ba9bf2d07ced73)
> Deprecate TikaLanguageIdentifierUpdateProcessor in v9.10
> --------------------------------------------------------
>
> Key: SOLR-17958
> URL: https://issues.apache.org/jira/browse/SOLR-17958
> Project: Solr
> Issue Type: Improvement
> Components: contrib - LangId
> Reporter: Jan Høydahl
> Assignee: Jan Høydahl
> Priority: Major
> Labels: pull-request-available
> Time Spent: 40m
> Remaining Estimate: 0h
>
> In the 'langid' module, we have three implementations of language detectors.
> The oldest one is TikaLanguageIdentifierUpdateProcessor, but we also have
> LangDetectLanguageIdentifierUpdateProcessor and
> OpenNLPLangDetectUpdateProcessor.
> This JIRA will deprecate TikaLanguageIdentifierUpdateProcessor.
> Reasons are:
> - The others are proably better
> - We want to remove Tika as a direct Solr dependency
> - The tika identifier is based on a Tika 1.x API that has been removed (they
> are now "Detectors" instead)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]