Hello folks, My users speak Chinese. I found that spamassassin seems not working well about chinese chset (utf8 or big5) on the bayes issue. Many normal mails (almost) get BAYES_99 score although the real spam also get BAYES_99. It looks like foreign language like Chinese is very easy to be high bayes scored. I have setup ok_locales all but it doesn't help the false-positive problem.
And another question: just wonder what if I do sa-learn --dump? Am I supposed to see the phrase that SA has learned? some key phrases, words in the spam mails? If so, can I see some chinese phrases? Cheers Joshua