I created some rules to match Polish text: http://www.chaosreigns.com/sa/polish.txt
The rules with only ascii characters work, the ones with utf8 characters don't. According to hexedit, they're identical in my maildir and in my /etc/spamassassin/local.cf. "SA can handle UTF-8 strings in rules at least since SA 3.2 on Perl 5.8.x." - http://spamassassin.1065346.n5.nabble.com/UTF-8-Spam-rules-td106485.html $ spamassassin --version SpamAssassin version 3.4.0-rsvnunknown $ perl --version This is perl, v5.10.1 (*) built for i486-linux-gnu-thread-multi spamassassin --lint has nothing to say. This properly prints a euro sign: $ perl -Mcharnames=:full -CS -wle 'print "\N{EURO SIGN}"' € But "spamassassin -t" says the rules with non-ascii utf8 characters aren't hitting. What am I missing? If anyone happens upon this email trying to get utf8 stuff straightened out, to get gnome-terminal to work I needed to add: $ cat .gnomerc export LANG="en_US.utf8" To get apache to work I needed: AddDefaultCharset utf-8 The rest is covered here: http://perlgeek.de/en/article/set-up-a-clean-utf8-environment