On 28 Oct 2003 16:21:58 +0200, I posted to spamassassin-talk:
 >  2. (For completeness, figure out the mapping between the original
 >     naming scheme and the one used in SpamAssassin. This is outlined
 >     in lm/README but it would be beneficial to get an actual mapping.
 >     I attach one I recreated by induction :-)

Oops, sorry.

The file contained a couple of minor typos because I had handled some
special cases by hand ... Here's a fixed version, now actually tested.

Missing from sa/lm:

  textcat/LM/drents.lm (0-byte-file)

Missing from textcat/LM:

  sa/lm/inactive/haw.lm
  sa/lm/ja.iso-2022-jp.ln
  sa/lm/tr.iso-8859-9.ln

The mappings are attached.

/* era */

LM/afrikaans.lm lm/af.lm
LM/albanian.lm  lm/sq.lm
LM/amharic-utf.lm       lm/am.utf-8.lm
LM/arabic-iso8859_6.lm  lm/ar.iso-8859-6.lm
LM/arabic-windows1256.lm        lm/ar.windows-1256.lm
LM/armenian.lm  lm/hy.lm
LM/basque.lm    lm/eu.lm
LM/belarus-windows1251.lm       lm/be.windows-1251.lm
LM/bosnian.lm   lm/bs.lm
LM/breton.lm    lm/inactive/br.lm
LM/bulgarian-iso8859_5.lm       lm/bg.iso-8859-5.lm
LM/catalan.lm   lm/ca.lm
LM/chinese-big5.lm      lm/zh.big5.lm
LM/chinese-gb2312.lm    lm/zh.gb2312.lm
LM/croatian-ascii.lm    lm/hr.us-ascii.lm
LM/czech-iso8859_2.lm   lm/cs.iso-8859-2.lm
LM/danish.lm    lm/da.lm
LM/dutch.lm     lm/nl.lm
LM/english.lm   lm/en.lm
LM/esperanto.lm lm/eo.lm
LM/estonian.lm  lm/et.lm
LM/finnish.lm   lm/fi.lm
LM/french.lm    lm/fr.lm
LM/frisian.lm   lm/fy.lm
LM/georgian.lm  lm/ka.lm
LM/german.lm    lm/de.lm
LM/greek-iso8859-7.lm   lm/el.iso-8859-7.lm
LM/hebrew-iso8859_8.lm  lm/he.iso-8859-8.lm
LM/hindi.lm     lm/hi.lm
LM/hungarian.lm lm/hu.lm
LM/icelandic.lm lm/is.lm
LM/indonesian.lm        lm/id.lm
LM/irish.lm     lm/ga.lm
LM/italian.lm   lm/it.lm
LM/japanese-euc_jp.lm   lm/ja.euc-jp.lm
LM/japanese-shift_jis.lm        lm/ja.shift-jis.lm
LM/korean.lm    lm/ko.lm
LM/latin.lm     lm/la.lm
LM/latvian.lm   lm/lv.lm
LM/lithuanian.lm        lm/lt.lm
LM/malay.lm     lm/ms.lm
LM/manx.lm      lm/inactive/gv.lm
LM/marathi.lm   lm/mr.lm
LM/middle_frisian.lm    lm/inactive/middle-frisian.lm
LM/mingo.lm     lm/inactive/mingo.lm
LM/nepali.lm    lm/ne.lm
LM/norwegian.lm lm/no.lm
LM/persian.lm   lm/fa.lm
LM/polish.lm    lm/pl.lm
LM/portuguese.lm        lm/pt.lm
LM/quechua.lm   lm/qu.lm
LM/romanian.lm  lm/ro.lm
LM/rumantsch.lm lm/rm.lm
LM/russian-iso8859_5.lm lm/ru.iso-8859-5.lm
LM/russian-koi8_r.lm    lm/ru.koi8-r.lm
LM/russian-windows1251.lm       lm/ru.windows-1251.lm
LM/sanskrit.lm  lm/sa.lm
LM/scots.lm     lm/sco.lm
LM/scots_gaelic.lm      lm/gd.lm
LM/serbian-ascii.lm     lm/sr.us-ascii.lm
LM/slovak-ascii.lm      lm/sk.us-ascii.lm
LM/slovak-windows1250.lm        lm/sk.windows-1250.lm
LM/slovenian-ascii.lm   lm/sl.us-ascii.lm
LM/slovenian-iso8859_2.lm       lm/sl.iso-8859-2.lm
LM/spanish.lm   lm/es.lm
LM/swahili.lm   lm/sw.lm
LM/swedish.lm   lm/sv.lm
LM/tagalog.lm   lm/tl.lm
LM/tamil.lm     lm/ta.lm
LM/thai.lm      lm/th.lm
LM/turkish.lm   lm/tr.unknown.lm
LM/ukrainian-koi8_r.lm  lm/uk.koi8-r.lm
LM/vietnamese.lm        lm/vi.lm
LM/welsh.lm     lm/cy.lm
LM/yiddish-utf.lm       lm/yi.utf-8.lm
-- 
The email address era     the contact information   Just for kicks, imagine
at iki dot fi is heavily  link on my home page at   what it's like to get
spam filtered.  If you    <http://www.iki.fi/era/>  500 pieces of spam for
want to reach me, see     instead.                  each wanted message.

Reply via email to