I noticed that there is a languages file in /usr/local/share/spamassassin/ (DEF_RULES_DIR) installed by the SA package as well as a newer and larger version installed by sa-update.
When I look at the debug it looks like the wrong one is being used: textcat: loading languages file /usr/local/share/spamassassin/languages