[ 
https://issues.apache.org/jira/browse/SOLR-17958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18029945#comment-18029945
 ] 

David Smiley commented on SOLR-17958:
-------------------------------------

+1

> Deprecate TikaLanguageIdentifierUpdateProcessor in v9.10
> --------------------------------------------------------
>
>                 Key: SOLR-17958
>                 URL: https://issues.apache.org/jira/browse/SOLR-17958
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - LangId
>            Reporter: Jan Høydahl
>            Assignee: Jan Høydahl
>            Priority: Major
>
> In the 'langid' module, we have three implementations of language detectors.
> The oldest one is TikaLanguageIdentifierUpdateProcessor, but we also have 
> LangDetectLanguageIdentifierUpdateProcessor and 
> OpenNLPLangDetectUpdateProcessor.
> This JIRA will deprecate TikaLanguageIdentifierUpdateProcessor.
> Reasons are:
> - The others are proably better
> - We want to remove Tika as a direct Solr dependency
> - The tika identifier is based on a Tika 1.x API that has been removed (they 
> are now "Detectors" instead)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to