[ 
https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315541#comment-17315541
 ] 

Tim Allison commented on TIKA-3340:
-----------------------------------

Nothing surprising in the results.  

Optimaize is the off the shelf optimaize with no new models, so it only covers 
half the languages and is scored only on the langs it knows.

OpenNLPLangDetector is the off the shelf OpenNLPLanguageDetector with the 
langdetect-183.bin models

OpenNLPTikaEvalDetector is tika-eval's integration with opennlp using the 
current model_20190626.bin model.

TikaDetector is the current homegrown langid model with new models built 
freshly for all langs.

TikaOpenNLPDetector is the current Tika opennlp integration with the proposed 
model.


> LanguageProfile for Myanmar
> ---------------------------
>
>                 Key: TIKA-3340
>                 URL: https://issues.apache.org/jira/browse/TIKA-3340
>             Project: Tika
>          Issue Type: Improvement
>          Components: languageidentifier
>            Reporter: Arky
>            Priority: Major
>         Attachments: table-summarized-truncated.txt.gz
>
>
> A language profile for detecting Myanmar/Burmese (my).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to