[ 
https://issues.apache.org/jira/browse/TIKA-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17312402#comment-17312402
 ] 

Tim Allison commented on TIKA-3343:
-----------------------------------

The LanguageHandler is currently hardcoded to use Tika's own LanguageDetector 
in 1.x and 2.x.  We should figure out how to let users configure different 
language detectors (whether that's service loading or configuration via 
tika-config) in 2.x for the LanguageHandler and ...anything else?

> Remove Tika custom lang detection for 2.x
> -----------------------------------------
>
>                 Key: TIKA-3343
>                 URL: https://issues.apache.org/jira/browse/TIKA-3343
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> In the back of my mind, this was an agreed upon change for 2.x. I can't find 
> documentation, tho, so I'm opening this issue to discuss.  
> My memory is that we agreed that we should outsource language id to other 
> tools and remove our own lang ider for 2.x.  If my memory is wrong, or if 
> there's a good reason to keep our language detection algorithm and data, 
> let's discuss.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to