[ 
https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17311738#comment-17311738
 ] 

Tim Allison commented on TIKA-3340:
-----------------------------------

[~arky], thank you for this pull request! Out of curiosity, which corpus did 
you use for training?

In Tika 2.x, we're moving away from our built-in language detection to existing 
projects: opennlp, optimaize, mitll, or lingo24.

If I added this now, it would probably disappear before 2.0.0-BETA is released. 
 Are you ok if I rebuild our opennlp model to include this Burmese?

> LanguageProfile for Myanmar
> ---------------------------
>
>                 Key: TIKA-3340
>                 URL: https://issues.apache.org/jira/browse/TIKA-3340
>             Project: Tika
>          Issue Type: Improvement
>          Components: languageidentifier
>            Reporter: Arky
>            Priority: Major
>
> A language profile for detecting Myanmar/Burmese (my).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to