[ 
https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17320273#comment-17320273
 ] 

Tim Allison commented on TIKA-3340:
-----------------------------------

We're up to 148 languages, including simplified vs traditional chinese and 
several new romanized langs (e.g. ben-rom).

{noformat}
afr
amh
ara
asm
ast
aze
bak
ban
bel
ben
ben-rom
bih
bos
bre
bul
cat
ceb
ces
che
ckb
cmn
cym
dan
deu
div
ekk
ell
eng
epo
est
eus
fao
fas
fin
fra
fry
ful
gla
gle
glg
gom
gsw
gug
guj
hat
hau
heb
hin
hin-rom
hrv
hun
hye
ibo
ind
isl
ita
jav
jpn
kan
kat
kaz
khm
kin
kir
knn
kor
kur
lao
lat
lav
lim
lin
lit
ltz
lug
lvs
mal
mar
mhr
min
mkd
mlg
mlt
mon
mri
msa
mya
mya-zaw
nan
nds
nep
new
nld
nno
nob
nso
oci
ori
orm
pan
pes
pnb
pol
por
pus
quz
roh
ron
rus
san
sin
slk
slv
snd
som
spa
sqi
srd
srp
ssw
swa
swe
tam
tam-rom
tat
tel
tel-rom
tgk
tgl
tha
tsn
tuk
tur
uig
ukr
urd
urd-rom
uzb
vie
vol
war
wol
xho
yid
yor
zho-simp
zho-trad
zul
{noformat}

> LanguageProfile for Myanmar
> ---------------------------
>
>                 Key: TIKA-3340
>                 URL: https://issues.apache.org/jira/browse/TIKA-3340
>             Project: Tika
>          Issue Type: Improvement
>          Components: languageidentifier
>            Reporter: Arky
>            Priority: Major
>         Attachments: 20210401-model.report.txt, 20210413.report.txt, 
> table-summarized-truncated.txt.gz
>
>
> A language profile for detecting Myanmar/Burmese (my).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to