> -----Original Message----- > I think you might need to uncomment > > loadplugin Mail::SpamAssassin::Plugin::TextCat > > in /etc/mail/spamassassin/v310.pre (or your equivalent). > > As an aside, I'm afraid I simply reject some of them out of hand if they > contain a particular MIME charset. Using an Exim ACL, I do: > > deny message = Sorry, no one speaks this language here. > condition = ${if eq{lc:$mime_charset}{gb2312}{1}{0}} > > That one's simplified Chinese, but I reject other languages as well, > including Russian. > > You may be able to do something similar with your MTA. And it's one less > message for SA to scan :) >
Does anyone know how to do this with Qmail, or is it best left to Spamassassin? I've written some custom rules in my local.cf file to tag the Russian spam that seems to be on the rise. I think they're getting around some of my regular expressions, however, and I'd really like a better way to do it. We do have clients of Asian origin, so it would be kind of nice to be able to tag spam for the character sets (koi8-r & windows-1255) that the Russian spam uses, leaving other charsets alone. Currently I'm just regexp'ing the subject and body of the message for those character sets and scoring accordingly. A better way would be to match phrases from the spam messages themselves, although this will cause more load on SA, it would be a "more world-friendly" method. This involves translating the spam messages into English, then matching up a spam-like phrase to tag, then finding that phrase in the actual email file on the server(no codepage conversions necessary) then grabbing the section of non-sensical UTF-8 characters to put in my local.cf file for scoring. Anyone have better rules/methods? Cheers, Mike